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This paper provides parametric and rank-based optimal tests for 
eigenvectors and eigenvalues of covariance or scatter matrices in el- 
liptical families. The parametric tests extend the Gaussian likelihood 
ratio tests of Anderson (1963) and their pseudo- Gaussian robustifi- 
cations by Davis (1977) and Tyler (1981, 1983). The rank-based tests 
address a much broader class of problems, where covariance matrices 
need not exist and principal components are associated with more 
general scatter matrices. The proposed tests are shown to outper- 
form daily practice both from the point of view of validity as from 
the point of view of efficiency. This is achieved by utilizing the Le Cam 
theory of locally asymptotically normal experiments, in the nonstan- 
dard context, however, of a curved parametrization. The results we 
derive for curved experiments are of independent interest, and likely 
to apply in other contexts. 

1. Introduction. This fairly detailed introduction aims at providing a 
comprehensive and nontechnical overview of the paper, including its asymp- 
totic theory aspects, and a rough description of some of the rank-based test 
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statistics to be derived. It is expected to be accessible to a broad reader- 
ship. It should be sufficiently informative for the reader not interested in the 
technical aspects of asymptotic theory, to proceed to Sections 5 (Gaussian 
and pseudo-Gaussian tests) and 6 (rank-based tests), where the proposed 
testing procedures are described, and for the reader mainly interested in 
asymptotics, to decide whether he/she is interested in the treatment of a 
LAN family with curved parametrization developed in Sections 3 and 4. 

1.1. Hypothesis testing for principal components. Principal components 
are probably the most popular and widely used device in the traditional 
multivariate analysis toolkit. Introduced by Pearson (1901), principal com- 
ponent analysis (PC A) was rediscovered by Hotelling (1933), and ever since 
has been an essential part of daily statistical practice, basically in all do- 
mains of application. 

The general objective of PCA is to reduce the dimension of some ob- 
served /c-dimensional random vector X while preserving most of its total 
variability. This is achieved by considering an adequate number q of lin- 
ear combinations of the form ^X, . . . ,/3^X, where (3j, j = 1, . . . , k, are the 
eigenvectors associated with the eigenvalues Ai,...,Ajt of X's covariance 
matrix S COV ; ranked in decreasing order of magnitude. Writing (3 for the 
orthogonal k x k matrix with columns /3 1 ,...,/3 fc and As cov for the diag- 
onal matrix of eigenvalues Ai,...,Afc, the matrix S cov thus factorizes into 
S C ov = /9As cov /3'. The random variable /3jX, with variance Xj, is known as 
X's jth principal component. 

Chapters on inference for eigenvectors and eigenvalues can be found in 
most textbooks on multivariate analysis, and mainly cover Gaussian maxi- 
mum likelihood estimation (MLE) and the corresponding Wald and Gaus- 
sian likelihood ratio tests (LRT). The MLEs of /3 and As cov are the eigen- 
vectors and eigenvalues of the empirical covariance matrix 



while testing problems classically include testing for sphericity (equality of 
eigenvalues), testing for sub sphericity (equality among some given subset of 
eigenvalues — typically, the last k — q ones), testing that the £th eigenvector 
has some specified direction, or that the proportion of variance accounted for 
by the last k — q principal components is larger than some fixed proportion 
of the total variance: see, for instance, Anderson (2003) or Jolliffe (1986). 

Gaussian MLEs and the corresponding tests (Wald or likelihood ratio 
tests — since they are asymptotically equivalent, in the sequel we indistinctly 
refer to LRTs) for covariance matrices and functions thereof are notoriously 
sensitive to violations of Gaussian assumptions; see Muirhead and Water- 
naux (1980) for a classical discussion of this fact, or Yanagihara, Tonda and 
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Matsumoto (2005) for a more recent overview. The problems just mentioned 
about the eigenvectors and eigenvalues of S cov are no exception to that rule, 
although belonging, in Muirhead and Waternaux's terminology, to the class 
of "easily robustifiable" ones. For such problems, adjusted LRTs remaining 
valid under the whole class of elliptical distributions with finite fourth-order 
moments can be obtained via a correction factor involving estimated kur- 
tosis coefficients [see Shapiro and Browne (1987) for a general result on the 
"easy" cases, and Hallin and Paindaveine (2008b) for the "harder" ones]. 
Such adjusted LRTs were obtained by Tyler (1981, 1983) for eigenvector 
problems and by Davis (1977) for eigenvalues. 

Tyler actually constructs tests for the scatter matrix X! characterizing the 
density contours [of the form (x — 0)'X -1 (x — 6) = constant] of an elliptical 
family. His tests are the Wald tests associated with any available estimator 
S of 5] such that n 1//2 vec(5] — S) is asymptotically normal, with mean zero 
and covariance matrix t , say, under /G J, where J- denotes some class 
of elliptical densities and \I/ f either is known or (still, under / £ T) can be 
estimated consistently. The resulting tests then are valid under the class J-. 
When the estimator S is the empirical covariance matrix S( n ), these tests 
under Gaussian densities are asymptotically equivalent to Gaussian LRTs. 
Unlike the latter, however, they remain (asymptotically) valid under the 
class J- 4 of all elliptical distributions with finite moments of order four, and 
hence qualify as pseudo-Gaussian versions of the Gaussian LRTs. 

Due to their importance for applications, throughout this paper, we con- 
centrate on the following two problems: 

(a) testing the null hypothesis Hq that the first principal direction (3 1 
coincides (up to the sign) with some specified unit vector /3° (the choice of 
the first principal direction here is completely arbitrary, and made for the 
simplicity of exposition only), and 

(b) testing the null hypothesis %q that Ylj=q+i^j/Ylj=i^j = P against 

the one-sided alternative under which Ylj=q+i ^j/Ylj=i -\? < P, P £ (0,1) 
given. 

The Gaussian LRT for (a) was introduced in a seminal paper by Anderson 
(1963). Denoting by Xj-s and Pj-s^ 3 = lj ■ ■ ■ >k, respectively, the eigenvalues 

and eigenvectors of S^, this test — denote it by (^Anderson — re j ec t s %o 
asymptotic level a) as soon as 

Qiterso* ■= niX^'iS^y 1 ? + X&PsMffi - 2] 

(1-1) 

_ _n_ v- (A j;S - Ai ;S ) 2 , (n) 0] 2 

exceeds the a upper-quantile of the chi-square distribution with (k — 1) 
degrees of freedom. The behavior of this test being particularly poor un- 
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der non-Gaussian densities, Tyler (1981, 1983) proposed a pseudo-Gaussian 
version <^g! Tyler' wmcri he obtains via an empirical kurtosis correction 

( 1 - 2 ) ^Tyler := (* + ^) <9 Anderson 

of (1.1) (same asymptotic distribution), where k^ n ' is some consistent estima- 
tor of the underlying kurtosis parameter Section 5.3 for a definition. 

A related test of Schott (1991) addresses the same problem where however 
fli is the first eigenvector of the correlation matrix. 

The traditional Gaussian test for problem (b) was introduced in the same 
paper by Anderson (1963). For any k x k diagonal matrix A with diag- 
onal entries Ai, . . . , X k , let a M (A) := 2(p 2 £J =1 X 2 + (1 - p) 2 Ej= q +i A ])- 

Defining (3 S := (/3 1;S , . . . ,/3 k . s ) and c Ihq := (-pl' q : (1 - p)l' k _ q )', with l e := 
(1, . . . , 1)' £ R , and denoting by dvec(A) the vector obtained by stacking 

the diagonal elements of a square matrix A, Anderson's test, 4>^a- Anderson > 
say, rejects the null hypothesis at asymptotic level a whenever 

. . Person ^^ 1/2 (« M (As))" 1/2 <, g dvec(/3' s SW/3 s ) 

k 

^(op.gCAsjr^ui-p) E a ^-pE a 

V j=q+l 



n 




is less than the standard normal a-quantile. Although he does not provide 
any explicit form, Davis (1977) briefly explains how to derive the pseudo- 
Gaussian version 

V 1 -^ J Davis •— K 1 + K I J Anderson 

of (1.3), where again is any consistent estimator of the underlying kur- 
tosis parameter n k - The resulting test (same asymptotic standard normal 

distribution) will be denoted as <^A-E>avis- 

Being based on empirical covariances, though, the pseudo-Gaussian tests 
based on (1.2) and (1.4) unfortunately remain poorly robust. They still are 
very sensitive to the presence of outliers — an issue which we do not touch 
here; see, for example, Croux and Haesbroeck (2000), Salibian-Barrera, Van 
Aelst and Willems (2006), and the references therein. Moreover, they do 
require finite moments of order four — hence lose their validity under heavy 
tails, and only address the traditional covariance-based concept of principal 
components. 

This limitation is quite regrettable, as principal components, irrespec- 
tive of any moment conditions, clearly depend on the elliptical geometry 
of underlying distributions only. Recall that an elliptical density over M. k is 
determined by a location vector E M fc , a scale parameter a G Rq" (where 
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a 2 is not necessarily a variance) , a real- valued k x k symmetric and positive 
definite matrix V called the shape matrix, and a standardized radial den- 
sity fi (whenever the elliptical density has finite second-order moments, the 
shape and covariance matrices V and S C ov ar e proportional, hence share 
the same collection of eigenvectors and, up to a positive factor, the same 
collection of eigenvalues). Although traditionally described in terms of the 
covariance matrix S C ovj most inference problems in multivariate analysis 
naturally extend to arbitrary elliptical models, with the shape matrix V or 
the scatter matrix X! := cr 2 V playing the role of S C ov Principal components 
are no exception; in particular, problems (a) and (b) indifferently can be for- 
mulated in terms of shape or covariance eigenvectors and eigenvalues. Below, 
Ay := diag(Ai ; v, • • • , Afc ; v) and (3 := ((3 1 , . . . , (3 k ) collect the eigenvalues and 
eigenvectors of the shape matrix V. 

Our objective in this paper is to provide a class of signed-rank tests which 
remain valid under arbitrary elliptical densities, in the absence of any mo- 
ment assumption, and hence are not limited to the traditional covariance- 
based concept of principal components. Of particular interest within that 
class are the van der Waerden — that is, normal-score — tests, which are 
asymptotically equivalent, under Gaussian densities, to the corresponding 
Gaussian LRTs (the asymptotic optimality of which we moreover establish 
in Section 5, along with local powers). Under non-Gaussian conditions, how- 
ever, these van der Waerden tests uniformly dominate, in the Pitman sense, 
the pseudo-Gaussian tests based on (1.2) and (1.4) above, which, as a result, 
turn out to be nonadmissible (see Section 7). 

Our tests are based on the multivariate signs and ranks previously con- 
sidered by Hallin and Paindaveine (2006a, 2008a) and Hallin, Oja and Pain- 
daveine (2006). Denote by Xi, . . . ,X n an observed n-tuple of fc-dimensional 
elliptical vectors with location 6 and shape V. Let Z,; := V _1 ' 2 (Xj — 0) de- 
note the sphericized version of Xj (throughout A 1//2 , for a symmetric and 
positive definite matrix A, stands for the symmetric and positive definite 
root of A): the corresponding multivariate signs are defined as the unit vec- 
tors Uj = Ui(0, V) := Z;/||Z;||, while the ranks i?J n) = R\ n) {0, V) are those 
of the norms ||Zj||, i = l,...,n. Our rank tests are based on signed-rank 
covariance matrices of the form 

n \ n + 1 / 

i=i \ 1 / 

where K : (0, 1) — > M stands for some score function, and Uj = Uj(0, V) and 
are computed from appropriate estimators 6 and V of 6 
and V. More precisely, for the testing problem (a), the rank-based test 
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rejects the null hypothesis Hq (at asymptotic level a) whenever 

j— 2 

exceeds the a upper-quantile of the chi-square distribution with (k — 1) 
degrees of freedom; here, Jk{K) is a standardizing constant and 0j stands 
for a constrained estimator of V's jth eigenvector; see (5.2) for details. As 

for problem (b), our rank tests 4>^! K are based on statistics of the form 

TP := (^^) 1/ \a P .(Av))-/ 2 c;, g dvec(AV 2 ^S^MV 2 ), 

where Av and (3 are adequate estimators of Ay and f3, respectively. The 
null hypothesis T~Lq is to be rejected at asymptotic level a whenever Tp is 

smaller than the standard normal a-quantile. 

These tests are not just validity-robust, they also are efficient. For any 
smooth radial density /i, indeed, the score function K = Kf r (see Sec- 
tion 2.2) provides a signed-rank test which is locally and asymptotically opti- 
mal (locally and asymptotically most stringent, in the Le Cam sense) under 
radial density f±. In particular, when based on normal or van der Waer- 
den scores K = K$ x := ^ , where denotes the chi-square distribution 
function with k degrees of freedom, our rank tests achieve the same asymp- 
totic performances as the optimal Gaussian ones at the multinomial, while 
enjoying maximal validity robustness, since no assumption is required on 
the underlying density beyond ellipticity. Moreover, the asymptotic relative 
efficiencies (AREs) under non-Gaussian densities of these van der Waerden 
tests are uniformly larger than one with respect to their pseudo-Gaussian 
parametric competitors; see Section 7. On all counts, validity, robustness, 
and efficiency, our van der Waerden tests thus perform uniformly better than 
the daily practice Anderson tests and their pseudo-Gaussian extensions. 

1.2. Local asymptotic normality for principal components. The method- 
ological tool we are using throughout is Le Cam's theory of locally asymp- 
totically normal (LAN) experiments [for background reading on LAN, we 
refer to Le Cam (1986), Le Cam and Yang (2000) or van der Vaart (1998); 
see also Strasser (1985) or Rieder (1994)]. Although this powerful method 
has been used quite successfully in inference problems for elliptical families 
[Hallin and Paindaveine (2002, 2004, 2005, 2006a), Hallin, Oja and Pain- 
daveine (2006) and Hallin and Paindaveine (2008a) for location, VARMA 
dependence, linear models, shape and scatter, resp.], it has not been con- 
sidered so far in problems involving eigenvectors and eigenvalues, and, as a 
result, little is known about optimality issues in that context. The main rea- 
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son, probably, is that the eigenvectors (3 and eigenvalues A are complicated 
functions of the covariance or scatter matrix X, with unpleasant identifi- 
cation problems at possibly multiple eigenvalues. These special features of 
eigenvectors and eigenvalues, as we shall see, make the LAN approach more 
involved than in standard cases. 

LAN (actually, ULAN) has been established, under appropriate regularity 
assumptions on radial densities, in Hallin and Paindaveine (2006a), for ellip- 
tical families when parametrized by a location vector 9 and a scatter matrix 
S [more precisely, the vector vech(5]) resulting from stacking the upper di- 
agonal elements of S]. Recall, however, that LAN or ULAN are properties 
of the parametrization of a family of distributions, not of the family itself. 
Now, due to the complicated relation between (0,vechXl) and the quanti- 
ties of interest A and (3, the (6, vech(S))-parametrization is not convenient 
in the present context. Another parametrization, involving location, scale, 
and shape eigenvalues and eigenvectors is much preferable, as the hypothe- 
ses to be tested then take simple forms. Therefore, we show (Lemma A.l) 
how the ULAN result of Hallin and Paindaveine (2006a) carries over to this 
new parametrization where, moreover, the information matrix, very conve- 
niently, happens to be block-diagonal — a structure that greatly simplifies 
inference in the presence of nuisance parameters. Unfortunately, this new 
parametrization, where f3 ranges over the set SOk of k x k real orthogo- 
nal matrices with determinant one, raises problems of another nature. The 
subparameter vec(/3) indeed ranges over vec(50fc), a nonlinear manifold 
of M. k , yielding a curved ULAN experiment. By a curved experiment, we 
mean a parametric model indexed by a ^-dimensional parameter ranging 
over some nonlinear manifold of R , such as in curved exponential families, 
for instance. Under a vec(/3)-parametrization, the local experiments are not 
the traditional Gaussian shifts anymore, but curved Gaussian location ones, 
that is, Gaussian location models under which the mean of a multinormal 
observation with specified covariance structure ranges over a nonlinear man- 
ifold of R , so that the simple local asymptotic optimality results associated 
with local Gaussian shifts no longer hold. To the best of our knowledge, such 
experiments never have been considered in the LAN literature. 

A third parametrization, however, can be constructed from the fact that (3 
is in SOk if it can be expressed as the exponential of a k x k skew-symmetric 
matrix l. Denoting by vech + (t) the vector resulting from stacking the upper 
off-diagonal elements of t, this yields a parametrization involving location, 
scale, shape eigenvalues and vech + (t); the latter subparameter ranges freely 
over M fc ( fc-1 )/ 2 , yielding a well-behaved ULAN parametrization where local 
experiments converge to the classical Gaussian shifts, thereby allowing for 
the classical construction [Le Cam (1986), Section 11.9] of locally asymptot- 
ically optimal tests. The trouble is that translating null hypotheses (a) and 
(b) into the t-space in practice seems unfeasible. 
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Three distinct ULAN structures are thus coexisting on the same families 
of distributions: 

(ULAN1) proved in Hallin and Paindaveine (2006a) for the (0, vech(X))- 
parametrization, serving as the mother of all subsequent ones; 

(ULAN2) for the location-scale-eigenvalues-eigenvectors parametrization, 
where the null hypotheses of interest take simple forms, but the local exper- 
iments happen to be curved ones; 

(ULAN3) for the location-scale-eigenvalues-skew symmetric matrix param- 
etrization, where everything is fine from a decision-theoretical point of view, 
with, however, the major inconvenience that explicit solutions cannot be 
obtained in terms of original parameters. 

The main challenge of this paper was the delicate interplay between these 
three structures. Basically, we are showing (Lemma A.l) how ULAN can be 
imported from the first parametrization, and (Section 3.3) optimality results 
from the third parametrization, both to the second one. These results then 
are used in order to derive locally asymptotically optimal Gaussian, pseudo- 
Gaussian and rank-based tests for eigenvectors and eigenvalues of shape. 
This treatment we are giving of curved ULAN experiments, to the best of 
our knowledge, is original, and likely to apply in a variety of other contexts. 

1.3. Outline of the paper. Section 2 contains, for easy reference, some 
basic notation and fundamental assumptions to be used later on. The main 
ULAN result, of a nonstandard curved nature, is established in Section 3, 
and its consequences for testing developed in Section 4. As explained in the 
Introduction, optimality is imported from an untractable parametrization 
involving skew-symmetric matrices. This is elaborated, in some detail, in 
Section 3.3, where a general result is derived, and in (Section 4.1), where 
that result is applied to the particular case of eigenvectors and (Section 4.2) 
eigenvalues of shape, under arbitrary radial density f\. Special attention 
is given, in Sections 5.1 and 5.2, to the Gaussian case (/i = 4>i); in Sec- 
tions 5.3 and 5.4, those Gaussian tests are extended to a pseudo-Gaussian 
context with finite fourth-order moments. Then, in Section 6, rank-based 
procedures, which do not require any moment assumptions, are constructed: 
Section 6.1 provides a general asymptotic representation result [Proposi- 
tion 6. 1 (i)] in the Hajek style; asymptotic normality, under the null as well 
as under local alternatives, follows as a corollary [Proposition 6.1(H)]. Based 
on these results, Sections 6.2 and 6.3 provide optimal rank-based tests for the 
eigenvector and eigenvalue problems considered throughout; Sections 7 and 
8 conclude with asymptotic relative efficiencies and simulations. Technical 
proofs are concentrated in the Appendix. 

The reader interested in inferential results and principal components only 
(the form of the tests, their optimality properties and local powers) may 
skip Sections 3 and 4, which are devoted to curved LAN experiments, and 
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concentrate on Section 5 for the "parametric" procedures, on Section 6 for 
the rank-based ones, on Sections 7 and 8 for their asymptotic and finite- 
sample performances. 

1.4. Notation. The following notation will be used throughout. For any 
k x k matrix A = (A{j), write vec(A) for the /c 2 -dimensional vector obtained 
by stacking the columns of A, vech(A) for the [k(k + l)/2]-dimensional vec- 
tor obtained by stacking the upper diagonal elements of those columns, 
vech + (A) for the [k(k — l)/2] -dimensional vector obtained by stacking the 
upper off-diagonal elements of the same, and dvec(A) =: (An, (dvec(A))')' 
for the fe-dimensional vector obtained by stacking the diagonal elements 
of A;dvec(A) thus is dvec(A) deprived of its first component. Let 
be the k x k 2 matrix such that Hfcvec(A) = dvec(A). Note that we then 
have that H^,dvec(A) = vec(A) for any k x k diagonal matrix A, which 
implies that H^HJ-, = Write diag(Bi, . . . , B m ) for the block-diagonal ma- 
trix with blocks Bi,...,B m and A® 2 for the Kronecker product A® A. 
Finally, denoting by the Ith. vector in the canonical basis of M. k , write 



2. Main assumptions. 

2.1. Elliptical densities. We throughout assume that the observations 
are elliptically symmetric. More precisely, defining 

F:={h:B$->R + :ii k - 1]h <oo}, 

where [i^h '■= r e h(r) dr, and 



we denote by , ...,X„ an observed n-tuple of mutually independent 
/c-dimensional random vectors with probability density function of the form 



(2.1) /(x) := c^jSr^MCx - 0)'S- 1 (x - 0)) 1/2 ), x G R k , 



for some fc-dimensional vector (location), some symmetric and positive 
definite (k x k) scatter matrix S, and some /i in the class T\ of standardized 
radial densities; throughout, |A| stands for the determinant of the square 
matrix A. 

Define the elliptical coordinates of as 





(2.2) 



ufV,s) 



S-V2(xj n) -fl) 

E -1/2(X(")-0)|| 



and 



E -1/2( X M_0)||. 
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Under the assumption of ellipticity, the multivariate signs 

i = 1, . . . ,n, are i.i.d. uniform over the unit sphere in M fc , and independent 

of the standardized elliptical distances df l \o,'S). Imposing that f\ £ F\ 
implies that the d i n \6,'S) , s, which have common density fik( r ) '■= 
(t l k-l;fi)~ 1 r 1 fi( r )I[r>o] ) with distribution function Fi k , have median one 
[-Fifc(l) = 1/2] — a constraint which identifies 5] without requiring any mo- 
ment assumptions [see Hallin and Paindaveine (2006a) for a discussion]. 
Under finite second-order moments, the scatter matrix S is proportional to 
the traditional covariance matrix S cov . 

Special instances are the fc-variate multinormal distribution, with radial 
density fi(r) = (/>i(r) : = exp(— a k r 2 /2), the fc-variate Student distributions, 
with radial densities (for v £ Rq" degrees of freedom) fx(r) = /| v {r) := (1 + 

a k,u r ' 2 : /i , )~( k+u ^' 2 ', and the /c-variate power-exponential distributions, with 
radial densities of the form f\{r) = ff v (r) := exp(— b k ^r 2ri ), r/ £ Kj; the 
positive constants a k , a kjU , and b k)Tj are such that f\ EJ-\. 

The derivation of locally and asymptotically optimal tests at standardized 
radial density f\ will be based on the uniform local and asymptotic normality 
(ULAN) of the model at given f±. This ULAN property — the statement of 
which requires some further preparation and is delayed to Section 3 — only 
holds under some further mild regularity conditions on f\. More precisely, 
we require fi to belong to the collection F a of all absolutely continuous 
densities in T\ for which, denoting by f\ the a.e. derivative of f\ and letting 
(p fl := the integrals 

(2.3) l k (h):= [\ 2 fi (r)f lk (r)dr and J k (h) := f r\\ (r) f lk (r) dr 
Jo Jo 

are finite. The quantities I k (fi) and J k {fi) play the roles of radial Fisher in- 
formation for location and radial Fisher information for shape/scale, respec- 
tively. Slightly less stringent assumptions, involving derivatives in the sense 
of distributions, can be found in Hallin and Paindaveine (2006a), where we 
refer to for details. The intersection of F a and Ff := {/i £ F\ : J °° r A f\ k {r) dr < 
oo} will be denoted as F A . 

2.2. Score functions. The various score functions K appearing in the 
rank-based statistics to be introduced in Section 6 will be assumed to satisfy 
a few regularity assumptions which we are listing here for convenience. 

Assumption (S). The score function K : (0, 1) — )■ K (SI) is continuous 
and square-integrable, (S2) can be expressed as the difference of two mono- 
tone increasing functions, and (S3) satisfies J K(u) du = k. 

Assumption (S3) is a normalization constraint that is automatically sat- 
isfied by the score functions K(u) = Kf^u) := ip^^F^^u^F^^u) associ- 
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ated with any radial density f\ G F a (at which ULAN holds); see Section 3. 
For score functions K,K\,K2 satisfying Assumption (S), let [throughout, U 
stands for a random variable uniformly distributed over (0, 1)] 

(2.4) J k {K u K 2 ) -^[K^K^U)}, J k (K):=J k (K,K) 
and 

(2.5) J k (K,f 1 ):=J k (K,K fl ); 

with this notation, J k (f\) = J k {K f x ,K f x ). 

The power score functions K a (u) := k(a + l)u a (a > 0), with J k {Ka) = 
k 2 (a + l) 2 /(2a + 1), provide some traditional score functions satisfying As- 
sumption (S): the sign, Wilcoxon, and Spearman scores are obtained for 
a = 0, a = 1 and a = 2, respectively. As for the score functions of the form 
Kf x , an important particular case is that of van der Waerden or normal 
scores, obtained for f\ = <j)\. Then 

(2.6) K^{u) = ^\u) and Mfa) = k(k + 2), 

where ^f k was defined in page 6. Similarly, Student densities f\ = f\ v (with 
v degrees of freedom) yield the scores 

k(k + u)G^ u ( U ) 
K f t (u) = 



and 



t ,_ k{k + 2)(k + v) 



where G kv stands for the Fisher-Snedecor distribution function with k and 
v degrees of freedom. 

3. Uniform local asymptotic normality (ULAN) and curved Gaussian lo- 
cation local experiments. 

3.1. Semiparametric modeling of elliptical families. Consider an i.i.d. re- 
tuple X^ n \ . . . ,xi"^ with elliptical density (2.1) characterized by 0, S, and 
/l : (0, X) or, if a vector is to be preferred, (0' , (vech £)')', provides a per- 
fectly valid parametrization of the elliptical family with standardized radial 
density f\. However, in the problems we are considering in this paper, it 
will be convenient to have eigenvalues and eigenvectors appearing explic- 
itly in the vector of parameters. Decompose therefore the scatter matrix 
X into XI = <r 2 V = /3As/3' = /3o" 2 Av/9', where a G is a scale parameter 
(equivariant under multiplication by a positive constant), and V a shape 
matrix (invariant under multiplication by a positive constant) with eigen- 
values A v = diag(Ai ;V , • • • , A fc;V ) = cr -2 diag(Ai ;S , . . . , A fc;S ) = o-~ 2 A s ; (3 is 
an element of the so-called special orthogonal group SO k := {0|0'0 = I k , 
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|0| = 1} diagonalizing both XI and V, the columns /3 l5 . . . ,f3 k of which are 
the eigenvectors (common to S and V) we are interested in. 

Such decomposition of scatter into scale and shape can be achieved in 
various ways. Here, we adopt the determinant-based definition of scale 

a := \^ k = 11 Ajg* hence V := V/a 2 = ^/IS] 1 ^, 

which implies that |V| = JIj=i A?;V = 1- As shown by Paindaveine (2008), 
this choice indeed is the only one for which the information matrix for scale 
and shape is block-diagonal, which greatly simplifies inference. The para- 
metric families of elliptical distributions with specified standardized radial 
density /i then are indexed by the L = k(k + 2)-dimensional parameter 

:= (0 / ,a 2 ,(dvecA v )',(vec / 3) , ) / =: (^n^m^ivY, 

where dvec(A v ) = (^2-,v, • • • , Afc ; v)' since Ai ;V = 0^=2 ^v- 

This i9-parametrization however requires a fully identified /c-tuple of eigen- 
vectors, which places the following restriction on the eigenvalues Ay. 

Assumption (A). The eigenvalues A J; v of the shape matrix V are all 
distinct, that is, since I] (hence also V) is positive definite, Ai ; v > A2 ; v > 

■•■ > A fc;V >0. 

Denote by P^!^ the joint distribution of X^ n \ . . . , X„ under parameter 
value t? and standardized radial density f\ G T\ ; the parameter space [the 
definition of which includes Assumption (A)] then is 

:= R k x M+ x C k ~ l x vec(SO k ), 

where C k ~ 1 is the open cone of (R^)^ 1 with strictly ordered (from largest 
to smallest) coordinates. 

Since vec(SOk) is a nonlinear manifold of M fc : the vec (/^-parametri- 
zed experiments are curved experiments, in which the standard methods 
[see Section 11.9 of Le Cam (1986)] for constructing locally asymptoti- 
cally optimal tests do not apply. It is well known, however [see, e.g., Khuri 
and Good (1989)], that any element f3 of SOk can be expressed as the 
exponential exp(t) of a k x k skew- symmetric matrix t, itself character- 
ized by the k(k — l)/2- vector vech + (t) of its upper off-diagonal elements. 
The differentiable mapping /i:vech + (t) i— > fi(vech + (t)) := vec(exp(t)) from 
jg>fc(fe-i)/2 to gQ k j g one -to-one, so that vech + (t) G R^C " 1 )/ 2 also can be 
used as a parametrization instead of vec(/3) G vec(SOk)- Both parametriza- 
tions yield uniform local asymptotic normality (ULAN). Unlike the vec(/3)- 
parametrized one, the vech + (t)-parametrized experiment is not curved, as 
vech + (t) freely ranges over R fc ( fc_1 )/ 2 ) so that the standard methods for con- 
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structing locally asymptotically optimal tests apply — which is not the case 
with curved experiments. On the other hand, neither the vech + (t.)-part of 
the central sequence, nor the image in the vech + (t)-space of the null hy- 
pothesis yield tractable forms. Therefore, we rather state ULAN for the 
curved vec(/3)-parametrization. Then (Section 3.3), we develop a general 
theory of locally asymptotically optimal tests for differentiable hypotheses 
in curved ULAN experiments. 

Without Assumption (A), the i?-parametrization is not valid, and can- 
not enjoy LAN nor ULAN; optimality properties (of a local and asymptotic 
nature) then cannot be obtained. As far as validity issues (irrespective of 
optimality properties) are considered, however, this assumption can be weak- 
ened. If the null hypothesis Hq is to make any sense, the first eigenvector 
j3 1 clearly should be identifiable, but not necessarily the remaining ones. 
The following assumption on the A J; v's, under which /3 2 , ... ,(5 k need not be 
identified, is thus minimal in that case. 

Assumption (A' x ) . The eigenvalues of the shape matrix V are such that 
Ai ; v > A 2; v > > A fc;V > 0. 

Under Assumption (A^), is broadened into a larger parameter space 
&i, which does not provide a valid parametrization anymore, and for which 
the ULAN property of Proposition 3.1 below no longer holds. As we shall 
see, all the tests we are proposing for Hq nevertheless remains valid under 
the extended null hypothesis Hq'i resulting from weakening (A) into (A' x ). 
Note that, in case the null hypothesis is dealing with /3 instead of (3±, the 
appropriate weakening of Assumption (A) is the following. 

Assumption (A' ). The eigenvalues of the shape matrix V are such that 

Al;V > • • • > Ag_l;V > Ag;V > Ag + l;V > " " > Ajfc ; V > 0. 

This yields enlarged parameter space 0^ and null hypothesis Hq.'„. 

Similarly, the null hypothesis Hq requires the identifiability of the groups 
of q largest (hence k — q smallest) eigenvalues; within each group, however, 
eigenvalues may coincide, yielding the following assumption. 

Assumption (A^ ) . The eigenvalues of the shape matrix V are such that 
Ai ; v > ■ ■ > A g _i ; v > A g; v > Aq + i ; v > • ■ • > Afc ; V > 0. 

This yields enlarged parameter space 0^' and null hypothesis H^q, say. 
As we shall see, the tests we are proposing for remain valid under Hq.'^. 

3.2. Curved ULAN experiments. Uniform local asymptotic normality 

(ULAN) for the parametric families or experiments := {P^ 1 ^ G ©}, 
with classical root-n rate, is the main technical tool of this paper. For 
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any 1? := (0',a 2 , (dvec Av)', (vec/3)')' € 0, a local alternative is a sequence 
tfW G such that (tf (ri) - tf) is C^ra" 1 / 2 ). For any such consider 
a further sequence tf (n) + ?i- 1 /V n ), with rW = ((t^)',^ ), (t^W)', 

( T /y(n)yy such that supraT (n)/ T (n) < ^ and #(n) + n -l/2 T (») £ Q for all 

n. Note that such exist: t 1 ^ can be any bounded sequence of M k , 
T n { n ) ari y bounded sequence with t 11 ^ > —n l ^ 2 a 2 ^ n \ t 111 ^ any bounded 
sequence of real (k — l)-tuples (t/ , . . . ,tF^[ ) such that 

0<A^ + n-VVfW<- 
< + n-VVf W < aW + n-^rf (n) 

<n(A^ + n- 1/2 r/-i (n 'V 1 , 
i=2 

which ensures that the perturbed eigenvalues A^ + n 

-l/2/n) 

, with 



A: 



1 



(3.1) 



^=2 



i=2 



and (^2i • • • j ) :=T in ( n \ still satisfy Assumption (A) and yield deter- 
minant value one. Writing tS n ^ for the diagonal k x k matrix with diagonal 
elements £^ , . . . , , we then have 



n) n-1 



J =2 



+E(^r 1 ^" ) 

3=2 

= 0{n- 1 ' 2 ). 

Finally, denote by M fc (A 2 , . . . , A fc ) = (-A^A^ 1 ,.. . , A^ 1 )' : I fc -i)' the value 
at (A2, • • • , Afc) of the Jacobian matrix of 

(A 2 ,...,A fc )^ ^Ai :=]^[AT 1 ,A2,...,A fc ^. 
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Letting A := diag(Ai, A2, • ■ • , A&), we have M' fc (A2, • ■ • , Xk) dvec(l) = dvec(l) 
for any k x k real matrix 1 such that tr(A _1 l) = 0. Indeed, 

M' fc (A 2 , . . . , A fc ) dvec(l) = (-Ai(A 2 -\ . . . , A" 1 )' \ ^/(dvecl) 

= (-A 1 (tr(A- 1 l) - (X^-Hn) i (dvecl)')' 
= dvec(l), 

an identity that will be used later on for M^ v := M/%(dvec(Av))- 

The problem is slightly more delicate for t iv ^ , which must be such that 
vec(/3 (n) ) + n - 1/2 T Ivin) remains in vec(50 fe ). That is, T Iv ^ n) must be of the 
form t iv ^ =vec(b( n )), with 

= ((3 in) + n-WbWyipW + n-^b*")) - I k 

(3.2) 

= n-V^^W'bf") + bW/3 (n) ) + n _1 bW'bW. 

That is, /3 (n)/ b( n ) + n -i/2 b (n)' b («) / 2 should be skew-symmetric. Such local 
perturbations admit an intuitive interpretation: we have indeed 

0(n) + n -V2 b (n) = pWpW^pW + n -l/2 b (n)) = + n -l/2 / g(n)/ b (n) j 

an expression in which + n _1 / 2 /3^ n ^b^ n \ up to a 0(n _1 ) quantity, co- 
incides with the first-order approximation of the exponential of a skew- 
symmetric matrix, and therefore can be interpreted as an infinitesimal ro- 
tation. Identity (3.2) provides a characterization of SOt in the vicinity of 
/3 (n) . The tangent space [in M. , at vec(/3)] to vec(tSOfc) is obtained by lin- 
earizing (3.2). More precisely, this tangent space is of the form 

{vec(/3 + b)| vec(b) £ M fc2 and (3'b + b'/3 = 0} 

(3-3) 

= {vec(/3 + b)| vec(b) £ M and /3'b skew-symmetric}. 
We then have the following result (see the Appendix for the proof). 

Proposition 3.1. The experiment := {P^lji? £ ©} is C/X^AT, 
i/rai/i central sequence &$f t '■= (-^^'/i j ^$7i > ^tfj/i ' ^$hfi )' > w ^ ere [with di : = 
4 n) (0> v ) and U< :=uf°(0,V) as de/med in (2.2), and letting M^ v := 
M fc (dvecA v )/, 

i=l 
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A ^ := ^M-v H ,(A- 1/2 /?T 2 1 vec(^ (| ) | U^) 



and 



a/V ._ r ,/3 T /3,A\ 



mt/i Gj := (g£. 12 G£ ;13 • • • Gf. (fe _ 1)fe ), Gf a . fc := ej ® f3 h - e h ® ^ and 

Lf ' AV == (L?'i A 2 V ^i A 3 V • • • ^(tV', := (A ^ V " A ^09ft ® 

and with block- diagonal information matrix 

(3.4) =dia g (r^ ;/i ,r4 / ;/i ,r^ i ,r^ /i ), 

w/iere, de/mm<? D fc (A v ) := ±M^ v H fc [I fc2 + K^A^'H^M^)', 

r l 2fc(/l) j 7/ Jfc(/l) ~ fc 2 r /// Jfc(/l) ^ / » \ 

r *h = -toS- y ' — 4^ — ' " MfcT^) Dfc(Av) 

and 

'~ 4fc(jfe + 2) * diag ^ 12 ' 13 '•••' l/ (fc-i)fcA^fcj , 

where Ujh := A :(; vA/ l; v/(A : , ; v — ^h-y) 2 ■ More precisely, for any local alterna- 
tive and any bounded sequence such that +n~ 1 / 2 T^ £ 0, we 
/We, under , 

A (n) --loefdP (n) /dP {n) ) 

= (rW) / AW ) . /i -J(TW) / r^rW + op(l) 

and A^n).^ — >■ A/(0, r^/J, asn— >co. 

The block-diagonal structure of the information matrix implies that 

inference on /3 (resp., Ay) can be conducted under unspecified 9, a and Ay 
(resp., (3) as if the latter were known, at no asymptotic cost. The orthogo- 
nality between the eigenvalue and eigenvector parts of the central sequence 
is structural, while that between the eigenvalue and eigenvector parts on 
one hand and the scale parameter part on the other hand is entirely due to 
the determinant-based parametrization of scale [see Hallin and Paindaveine 
(2006b) or Paindaveine (2008)]. Note that T$ fl , with rank k(k - l)/2 < k 2 , 
is not invertible. 
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3.3. Locally asymptotically optimal tests for differentiable hypotheses in 
curved ULAN experiments. Before addressing testing problems involving 
eigenvalues and eigenvectors, we need a general theory for locally asymptot- 
ically optimal tests in curved ULAN experiments, which we are developing 
in this section. 

Consider a ULAN sequence of experiments {P^ £ 3}, where S is 
an open subset of M m , with central sequence A^ and information T^. For 
the simplicity of exposition, assume that for any £ has full rank m. 
Let h:S — > MP, p>m, be a continuously differentiable mapping such that 
the Jacobian matrix Dh(£) has full rank m for all £, and consider the ex- 
periments :i? £ := /i(H)}, where, with a slight abuse of notation, 
:= P^ for # = h(£). This sequence also is ULAN, with central se- 
quence A# and information matrix such that [see (A. 5) and the proof 
of Lemma A.l], at # = h(£), and [up to Op (l)'s which, for simplicity, 
we omit here] A^ = Dh'(€)A# and 1^ = Dh'(j£)r#Dh(£) — throughout, we 
write Dh'(-), Dt>'(-), etc., instead of (Dh(-))' , (£>&(•))', etc. In general, 
is a nonlinear manifold of MP; the experiment parametrized by then is a 
curved experiment. 

Next, denoting by C an r-dimensional manifold in M p , r < p, consider 
the null hypothesis T~Lq :i?£Cfl — in general, a nonlinear restriction of 
the parameter space 0. The same hypothesis can be expressed in the £- 
parametrization as Ho : £ £ Ho, where 3o := H (C H 0) is a (^-dimensional, 
say) submanifold of S. Fix £ = /i~ 1 (i9o) £ Ho, and let t: B C M^ — > H be a 
local (at £ ) chart for this manifold. 

Define olq := t~ 1 (^ ). At £ , Hq is linearized into : £ £ £ +M(Dt(ao)), 
where DT(olq) is the Jacobian matrix of t (with rank £) computed at olq and 
Ai(A) denotes the vector space spanned by the columns of a matrix A. At 
c*0) a locally asymptotically most stringent test statistic (at £ ) for T~L^ is 

(3.5) Q io := A^(r^ - Dt(ct )(Df (cto^Dtioo^Df (<*„)) A* 

[see Section 11.9 of Le Cam (1986)]. This test statistic is nothing else but the 
squared Euclidean norm of the orthogonal projection, onto the linear space 

1 /2 — 1 /2 

orthogonal to Dt(ao), of the standardized central sequence 
In view of ULAN, the asymptotic behavior of A^ Q is the same under local 
alternatives in Ho as under local alternatives in £ + A4(Dt(ao)), so that 
the same test statistic Q$ , which (at £ ) is locally asymptotically most 
stringent for , is also locally asymptotically most stringent for Hq. 

In many cases, however, it is highly desirable to express the most strin- 
gent statistic in the curved 0-parametrization, which, as is the case for 
the eigenvalues/eigenvectors problems considered in this work, is the nat- 
ural parametrization. This is the objective of the following result (see the 
Appendix for the proof). 
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Proposition 3.2. With the same notation as above, a locally asymp- 
totically most stringent statistic (at i?oJ f° r testing %q : 1? £ C n is 

(3.6) Qg = Q* := A^ (r^ o - Dd(r, )(Dh'(r, )T^Db(r, )rDb / (r l0 ))A^, 

where 6:4cl < ->l p is a local (at t9q) chart for the tangent ( still at $o ) 
to the manifold C Pi 0, r/ := 5~ 1 (i9o)> anc ^ A~ denotes the Moore-Penrose 
inverse of A. 

Hence, a locally asymptotically most stringent (at £ or i?0j depending on 
the parametrization) test for Hq can be based on either of the two quadratic 
forms Q^ Q or Q& , which coincide, and are asymptotically chi-square [(m — i) 

degrees of freedom] under = , for i?o = M£o)- P° r practical imple- 
mentation, of course, an adequately discretized root-n consistent estimator 
has to be substituted for the unknown i9o or £ — which asymptotically does 
not affect the test statistic. 

Provided that S remains an open subset of R m , the assumption of a 
full-rank information matrix is not required. Hallin and Puri [(1994), 
Lemma 5.12] indeed have shown, in the case of ARMA experiments, that (3.5) 
remains locally asymptotically most stringent provided that generalized in- 
verses (not necessarily Moore-Penrose ones) are substituted for the inverses 
of noninvertible matrices, yielding 

Q $0 := A^(rj o -Dt( ao )(Dt( ao )T^Dt( ao ))-Dt( ao ))A $0 . 

The same reasoning as in the proof of Proposition 3.2 then applies, mutatis 
mutandis, when "translating" Q^ Q into Q& (with appropriate degrees of 
freedom) . 

4. Parametrically optimal tests for principal components. 

4.1. Optimal parametric tests for eigenvectors. Testing the hypothesis 
Hq on eigenvectors is a particular case of the problem considered in the pre- 
vious section. The vech + (t) parametrization [t an arbitrary skew-symmetric 
(k x k) matrix] yields a standard ULAN experiment, with parameter 

£ := (6', a 2 , (dvec(Av))', (vech + (/,))')' Gft'x R + x C h ~ 1 x K^- 1 )/ 2 =: E, 

hence m = k(k + 3)/2, while Proposition 3.1 provides the curved ULAN 
experiment, with parameter $ £ C MP and p = k{k + 2). ULAN for the 
^-experiment readily follows from the fact that the mapping vech + (t) i— > 
vec(/3) =vec(exp(t)) is continuously differentiable. 

As explained before, the block-diagonal structure of the information ma- 
trix (3.4) implies that locally asymptotically optimal inference about (3 can 
be based on A^Y^ only, as if 6, a 2 and dvec(Av) were specified. Since this 
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also allows for simpler exposition and lighter notation, let us assume that 
these parameters take on specified values 0, a 2 and (A2 ; v> • • • , Afc ; v)> re- 
spectively. The resulting experiment then is parametrized either by vec/3 G 
vec(50/j) C M fe (playing the role of i9 G C W in the notation of Proposi- 
tion 3.2) or by vec h+(t) G R fc ( fc - 1 )/ 2 (playing the role of £). 

In this experiment, the null hypothesis 'Hq consists in the intersection of 
the linear manifold C := (/3 0/ , Oix(k-l)k)' + M. (T), where T := (0 fc ( fc „ 1 ) xfc , 
Ifc(fc-i))', with the nonlinear manifold vec(50fc). Let /3 := (/3° , (3 2 , ■ ■ ■ , (3 k ) 
be such that vec(/3 ) belongs to that intersection. In view of Proposition 3.2, 
a most stringent test statistic [at vec(/3 )] for Hq requires a chart for the 
tangent to C n vec(SO k ) at vec(/3 )- It follows from (3.3) that this tangent 
space reduces to 

{vec(/3 + b)|b := (0,b 2 , . . . ,b fc ) such that (3' b + b'(3 = 0}. 

Solving for vec(b) = (0', b' 2 , . . . , h' k )' the system of constraints (3' h+b'(3 - 

Pi 
k 

/ Ofcxfc(fc-l) \ 

k-l 

fc _! [I fe - /3°/3 '] - ]T K-fc-ie;,^ (8 /3 i+1 # +1 ] 



yields vec(b) G A4(Pf°), where 



(4.1) Pf°:= 

\ ij=i / 

(with ej.fe_i denoting the ith vector of the canonical basis of IR fc_1 ). A lo- 
cal chart for the tangent space of interest is then simply & : 77 G 1— > 
h(r/) := vec(/3 ) + P k °V, with ?7 = S^vec^)) = (fc _i )fe and Dh(r) Q ) = 

Pf°. Letting tf := (0',cr 2 , (dvec A v )', (vec/3 )')', the test statistic (3.6) takes 
the form 

(4.2) 



nfc(fc + 2) y^, ^2 



with 



(4.3) S^ ;/i := - 2^ V/i ( ) u i(°> v ) u i( fi ' 5 v ), 

i=l ^ ' 

where V denotes the unique shape value associated with the parameter 
After simple algebra, we obtain 

, . r^ ;/l [(r^ ;/i )- -pf ((pf ) / r^;/ 1 Pf )~(P? ) / ] 

(4-4) 

= 2^fc° diag(Ifc-l,0( fc _2)(fc_l)/2x(fc-2)(fe-l)/2)(G fc °) / , 
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which is idempotent with rank (£; — 1). Since, moreover, A^.^ , under P^.^ > 
is asymptotically Af(0, T^.^), Theorem 9.2.1 in Rao and Mitra (1971) then 

shows that Q^-fc , still under P^.^ , is asymptotically chi-square with {k — 1) 
degrees of freedom. 

The resulting test, which rejects Hq at asymptotic level a whenever Q^.^ 
exceeds the a-upper quantile xl-i i-a °f * ne Xk-i distribution, will be de- 
noted as ^B-h - ^ * s l° cau y asymptotically most stringent, at i?o an d under 
correctly specified standardized radial density fx (an unrealistic assump- 
tion). Of course, even if fx were supposed to be known, Q^.^ still depends 

on the unspecified #,<7 2 ,Av and /3 2 , • • • ,/3 fc . In order to obtain a genuine 
test statistic, providing a locally asymptotically most stringent test at any 
t?o G (with an obvious abuse of notation) , we would need replacing those 
nuisance parameters with adequate estimates. We will not pursue any further 
with this problem here, as it is of little practical interest for arbitrary den- 
sity fx- The same problem will be considered in Section 5 for the Gaussian 
and pseudo-Gaussian versions of (4.2), then in Section 6 for the rank-based 
ones. 

4.2. Optimal parametric tests for eigenvalues. We now turn to the prob- 
lem of testing the null hypothesis Hq : Y^ k j= q +x ^j;V~PE^=i -\j';V = against 
alternatives of the form T~L^ : J2j= q +x Aj;V — pJ2j=x A?';V < 0, for given p G 
(0,1). Letting 

k I k k \ 

h:(X 2 ,X 3 ,...,X k yeC k - 1 ^ X i~P II A 7 1 + E A i 

j=q+l \j=2 j=2 J 

and recalling that Ilj=i ^i;V = 1, Hq rewrites, in terms of dvec(Av), as 
: /i(dvec(Av)) = 0, a highly nonlinear but smooth constraint on dvec(Av)- 
It is easy to check that, when computed at dvec(Av), the gradient of h is 

grad /i(dvec(Av)) 

= (p(Ai ; vA^V -!))••■ >P(Ai ; vA,"v - 1), 

1 +p(A 1; vA^ 1;V - 1), . . . , 1 +p(Ai ;V Afe.V - !))'• 

Here again, in view of the block-diagonal form of the information ma- 
trix, we may restrict our attention to the dvec(Av)-part A^.^ of the cen- 
tral sequence as if 8, a 2 and (3 were known; the parameter space then 
reduces to the (k — l)-dimensional open cone C k ~ l . Testing a nonlinear 
constraint on a parameter ranging over an open subset of is much 

easier however than the corresponding problem involving a curved experi- 
ment, irrespective of the possible noninvertibility of the information matrix. 
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In the noncurved experiment, indeed, a linearized version % A im : dvec(A-y-) £ 
dvec(Ao) +A / l ± (grad/i(dvec Ao)) of % A in the vicinity of dvec(Ao) satisfying 
/i(dvec Ao) = makes sense LM^(A) denotes the orthogonal complement of 
M(A)]. And, as mentioned in Section 3.3, under ULAN, the asymptotic be- 
havior of A^.y^, with $o = (0' ,<7 2 , (dvecAo)', (vec/3)')', is locally the same 
under % A lin as under T~Lq- As for the "linearized alternative" % A lin consisting 
of all dvec A values such that (dvec A — dvec Ao)'grad/i(dvec Ao) < 0, it lo- 
cally and asymptotically coincides with % A : indeed, although the symmetric 
difference 'H A A'% A 1 - , for fixed n, is not empty, any dvecAo + n _1 / 2 r /// € 
^llin eventually belongs to H A , and conversely. Therefore, a locally (at 
dvecAo) asymptotically optimal test for % A lin against Tif~\ in is also locally 
asymptotically optimal for % A against % A , and conversely, whatever the lo- 
cal asymptotic optimality concept adopted. Now, in the problem of testing 
y.Qll n against % A lin the null hypothesis is (locally) a hyperplane of 
with an alternative consisting of the halfspace lying "below" that hyper- 
plane. For such one-sided problems (locally and asymptotically, still at t9q) 
uniformly most powerful tests exist; a most powerful test statistic is [Le Cam 
(1986), Section 11.9] 



T<J := (grad'/ i (dvecA )(r^. /l )- 1 grad ^(dvec A ))~ 1/2 

(4.5) 

x grad / / l (dvecA )(r^ ;/i )- 1 A^ ;/i , 



(n) 

which, under P^.^ , is asymptotically standard normal. An explicit form of 

^i?o7i rec l urres a closed form expression of the inverse of = {Jk{fi)l 

k(k + 2)) x Dfc(Av). The following lemma provides such an expression for 
the inverse of D/c(Av) (see the Appendix for the proof). 

Lemma 4.1. Let P Av := I fc2 - ±A® 2 vec(Av 1 )(vec(A v 1 ))' and N fc := 
(P {k - 1)x ul k -i)- Then, (Dfc(Av))- 1 = N fc H fc P Av (I fc2 +K fc )Af (P Av )'H' fe N' fe . 

Using this lemma, it follows after some algebra that, for any #o G % A , 
grad' /i(dvec Ao)(Dfc(Ao)) _1 grad/i(dvec Ao) 

= 2 {p 2 E4o + ( 1 -f) 2 £ A 2 ;0 }=a M (A ) 

I j=l j=q+l ) 

[where A i— > a pq (A) is the mapping defined in (1.3)], and 

grad'/ l (dvecAo)(D fc (A ))- 1 M Ao H fe (A " 1/2 )® 2 =c; i(/ H fc (A^ /2 )^. 
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This and the definition of yields 

(4-6) itk = ( ! ^^) 1/ V M (Ao))" 1/2 <,dvec(Ay 2 /3'S^ /i /3Af ) 

with S^l defined in (4.3). The corresponding test, which rejects %q for 
small values of T^K , will be denoted as (p\; f . 

4.3. Estimation of nuisance parameters. The tests <^g^ an d </>a-/i de- 
rived in Sections 4.1 and 4.2 typically are valid under standardized radial 
density f\ only; they mainly settle the optimality bounds at given density f±, 
and are of little practical value. Due to its central role in multivariate anal- 
ysis, the Gaussian case (/i = 4>\) is an exception. In this subsection devoted 
to the treatment of nuisance parameters, we therefore concentrate on the 
Gaussian tests 4>q)^ and ^^-ii' to be considered in more detail in Section 5. 

The test statistics derived in Sections 4.1 and 4.2 indeed still involve 
nuisance parameters which in practice have to be replaced with estimators. 
The traditional way of handling this substitution in ULAN families consists 
in assuming, for a null hypothesis of the form # G Hq, the existence of 
a sequence &( n ' of estimators of $ satisfying all or part of the following 
assumptions (in the notation of this paper). 

Assumption (B). We say that a sequence of estimators (^ n ',n£N) 
satisfies Assumption (B) for the null T-Lq and the density f± if is: 

(Bl) constrained: pj*^ [¥ n) G Hq] = 1 for all n and all 

(B2) n 1 / 2 -consistent: for all G H , nV 2 (tf( n ) -&) = O p (1) under pj^, as 
n — > oo; 

(B3) locally asymptotically discrete: for all # G Hq and all c > 0, there exists 
M = M(c) > such that the number of possible values of in balls 
of the form {tin 1 / 2 !!^ — < c} is bounded by M, uniformly as 
n — > oo. 

These assumptions will be used later on. In the Gaussian or pseudo- 
Gaussian context we are considering here, however, Assumption (B3) can be 
dispensed with under arbitrary densities with finite fourth-order moments. 
The following asymptotic linearity result characterizes the asymptotic im- 
pact, on AjL and A^^, under any elliptical density g\ with finite fourth- 
order moments, of estimating $ (see the Appendix for the proof). 

Lemma 4.2. Let Assumption (A) hold, fix'&E® and g\ G J-f, and write 
Dk(gi) '■= ti>k+i;gi / fJ-k-i-.gi ■ Then, for any root-n consistent estimator # := 
{fr'M,* 111 '^')' of* under Pj^, both Af^- A™ M +a h {D k { gi )/k)x 
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r^y/ 2 (tf ffl - » UI ) and - A$k + ^(^(^O/^r^n^^ - 

■d IV ) are op(l) under P^) gi , as n— >oo, where a& was defined in Section 2.1. 

5. Optimal Gaussian and pseudo-Gaussian tests for principal components. 

5.1. Optimal Gaussian tests for eigenvectors. For fi = 4>i, the test statis- 
tic (4.2) takes the form 

( 5 -!) Q [ *L = -E(#4 re > ) 2 = "^sJJ^ ( i fc _ 

3=2 

with := ^SrE?=i v_1/2 ( x i - #)( x i - 0) / V~ 1 / 2 . This statistic still 

depends on nuisance parameters, to be replaced with estimators. Letting 
S^ n ) = ^ Ya=i ( Xi ~~ x )( x i ~~ x )'; a natural choice for such estimators would 

S (n) = .| S (»)|V*v = f|S^W=: (^)^^ v Av^, 

where Ay is the diagonal matrix collecting the eigenvalues of V (ranked 
in decreasing order), /3 V := (/3 1;V , . . . , /3 fc . v ) is the corresponding matrix 
of eigenvectors, and <r 2 stands for the empirical median of d?(X,V), i = 
1, . . . ,n. For f3, however, we need a constrained estimator (3 satisfying As- 
sumption (B) for Hq (/3 v hi general does not). Thus, we rather propose 
estimating i9 by 

(5.2) := (X', a 2 , (dvec A v )', (vecft,)')', 

where (3 := (f3°,/3 2 , ■ ■ ■ ,Pk) can be obtained from (fii-v, ■ ■ ■ ,Pk-v) Yia the 
following Gram-Schmidt technique. Let /3 2 := (Ife — /3 /3 )/3 2 -v/ II (■"■* ~~ 
/3 /3°')/3 2: vll • By construction, /3 2 is the unit-length vector proportional to 
the projection of the second eigenvector of S^ 71 ) onto the space which is 
orthogonal to /3°. Iterating this procedure, define 

~ g (It-^-EtS^^fev . = 3 , 

P] iKife-w-Ea^^vii' 3 

The corresponding (constrained) estimator of the scatter 5] is S := cr 2 V : = 

It is easy to see that /3 , under #0 £ %o > inherits /3 v 's root-re consistency, 
which holds under any elliptical density g\ with finite fourth-order moments. 
Lemma 4.2 thus applies. Combining Lemma 4.2 with (4.4) and the fact that 

Gf (} diag(I fc _i,0 (A ._2)(fc_i)/2x(fc-2)(fc-i)/2)(Gf )'vec(^ - (3 ) = 
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(where (3 is the matrix of eigenvectors associated with t?q), one easily 
obtains that substituting t9 for #o hi (5-1) has no asymptotic impact on 
Gtfotfi— more P recisel y' ~ is Ml) as n ^ oo under pJJ.^, 

with g\ 6 J 7 ^. It follows that shares the same asymptotic optimality 

properties as Q^.^, irrespective of the value of i?o G %o • Thus, a locally and 

asymptotically mosi stringent Gaussian test of Hq — denote it by <^]y — can 
be based on the asymptotic chi-square distribution [with (k — 1) degrees of 
freedom] of 

q(jo = ^^(^v- 1 / 2 sWv- 1 V) 2 

(5.3) =T^ L E^(^S(")/30) 2 
°" A i;V j=2 



,2|o(n)|2/fc *! , ^ 

""A7i(3'.S(»))3 ) 2 =:Q{ n) 



Since a 2 /|S^ n ^| 1//fc converges to as n — > oo under the null Hq and Gaussian 
densities, one can equivalently use the statistic 



k 



which, of course, is still a locally and asymptotically most stringent Gaussian 
test statistic. For results on local powers, we refer to Proposition 5.1. 

This test is valid under Gaussian densities only (more precisely, under ra- 
dial densities with Gaussian kurtosis). On the other hand, it remains valid in 
case Assumption (A) is weakened [as in Anderson (1963) and Tyler (1981, 
1983)] into Assumption (A^). Indeed, the consistency of X! remains unaf- 
fected under the null, and /3° still is an eigenvector for both 5] -1 / 2 and S, 
so that [I fe - /3°/3 0/ ]£~ 1/2 :£5r 1/2 /3 = 0. Hence, 

i=2 

^0/2-1/2^)2-1/2 (jyfp^ S-l/2 S Ws-l/2^0 



na,. 
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= nai / 3°'S- 1 /2 (s (n)_ a -i 5]) ^-i/2 

x [I k - /3°/3°']S~ 1/2 (S (n) _ o-ijjjs-VapO 
= nal^ 0/ E- 1 / 2 (sW-a^ 1 E)E- 1 / 2 

x [I fe - / g°^°']E- 1 /2(sW - a- 1 S)S- 1 / 2 /3° + o P (l), 

asn^oo under W^. Since n 1 / 2 o fc E -1 / 2 (SW - 1 E)S -1 / 2 )3 is asymp- 
totically M(0, I fc + (3° (3°') as n — )• oo under T^q-i anc ^ Gaussian densities, this 
idempotent quadratic form remains asymptotically chi-square, with (k — 1) 
degrees of freedom, even when (A) is weakened into (A^), as was to be shown. 

This test is also invariant under the group of transformations G TO t,o map- 
ping (Xi, . . . , X n ) onto (OXi + t, . . . , OX n + 1), where t is an arbitrary k- 

vector and O £ SO% := {O G SOk\O/3 = /3 }, provided that the estimator 
of $0 used is equivariant under the same group — which the estimator i9 pro- 
posed in (5.2) is. Indeed, denoting by Q^ } (0,t), 0(O,t), A s (0,t), /3(0,t), 
E(0,t) and S( n )(0,t) the statistics Q$ , tf, A s , /3, S and S( n ) computed 
from the transformed sample (OX 1 ^ + t, . . . , Ox["' + t), one easily checks 
that, for any O £ Sof , A s (0,t) = A s , /3(0,t) = 0/3, S(0,t) = OSO' 
and S(")(0,t) = OS ^O', so that (noting that O'j3 = (3°) 

k 

Q^ ) (O,t)=nal^(^.O'OE- 1 / 2 O'OS( ri )O / OS- 1 / 2 O / /3 ) 2 = Q^ ) . 
i=2 

Finally, let us show that ^Anderson anc ^ QjV asymptotically coincide, un- 
der "Hq-i anc ^ Gaussian densities, hence also under contiguous alternatives. 
This asymptotic equivalence indeed is not a straightforward consequence of 
the definitions (1.1) and (5.3). Since Y^j=2 ^7-s(^j;s(^'j;S ~ PjP'j) i s °p(1) an d 
n 1 / 2 (SW — j3 Asj3'o) is Op(l) as n — > oo, under Hq'i and Gaussian densities 
[with As := diag(Ai ; s, • • • , Afc ; g)], it follows from Slutsky's lemma that 

k 

^Anderson = 7 E A i^[(Aj;S - Ai ;S )/3j ;S /3°] 2 

Al;S^ 

= E X iiW s(n) - ^oAs^)/3 ] 2 + o P (l) 
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= T^E A rs(^sW/3°) 2 + OP (l) 

= 0# + op(i) 

as n — )• oo, still under 'Hq-i and Gaussian densities. The equivalence between 
^Anderson an< ^ Oat ™ ^ ne Gaussian case then follows since Qj^ = Q$ + 
op(l) as n — > oo, under and Gaussian densities. 



5.2. Optimal Gaussian tests for eigenvalues. Turning to , we now 
consider the Gaussian version of the test statistic T^K obtained in Sec- 
tion 4.2. In view of (4.6), we have 

(5-4) 4;L=- 1/2 (« P , 9 (Ao))" 1/2 < i9 dvec(Ay 2 /3'sW, i /3Af) 

[recall that Jk{<i>i) = k(k + 2); see (2.6)]. Here also we have to estimate #o m 
order to obtain a genuine test statistic. By using the fact that (3Ao(3' = Vo 
(where all parameter values refer to those in $o), we obtain that, in (5.4), 

n 1 /2 c ; gdvec( Ay 2 ^ s H ^ 

(5.5) 

1/2 / i n \ 

V^dvec /3'i£(X i -0)(X i -0)'/3 , 



a 2 p ' 9 \ 



a Op(l) expression which does not depend on Ao- In view of Lemma 4.2 
and the block-diagonal form of the information matrix, estimation of 6, a 2 
and /3 has no asymptotic impact on the eigenvalue part of the central 

sequence, hence on T^.. As for a Pj(? (Ao), it is a continuous function of 
Ao, so that, in view of Slutsky's lemma, plain consistency of the estimator 
of Ao is sufficient. Consequently, we safely can use here the unconstrained 
estimator 

(5.6) := (X / ,<7 2 ,(dvecA v ) / ,(vec^v) / ) / ; 

see the beginning of Section 5.1. Using again the fact that, under Gaussian 
densities, <r 2 /|S ( ' n - ) | 1 / fc converges to as n—>oo, a locally and asymptoti- 
cally most powerful Gaussian test statistic therefore is given by 

T (n) ;= ^V (ap?(Av)) -l/2 c ^ dvec( 3 Vs (n)^ v) 



0~ 



T) l/2„ \o(n)\l/k 

= " ak [° 1 (a p , g (A s ))- 1 / 2 < 9 dvec(^S ( "^v; 

(5.7) 
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/ k q \ 



V j=g+i j=i / 



+ o P (l), 

under Gaussian densities as n—>oo. The corresponding test, <ft A .jy say, re- 
jects Hq whenever Tffi is smaller than the standard normal a-quantile; (5.7) 
shows that T^' coincides [up to op(l)] with T^ dcrson given in (1.3), which 
entails that (i) ^^-Anderson * s a ^ so l° can y an d asymptotically most powerful 
under Gaussian densities, and that (ii) the validity of 4^j^ ex t en ds to Hq^ 
(since the validity of Anderson 

does). 

5.3. Optimal pseudo-Gaussian tests for eigenvectors. The Gaussian tests 
^flA/ an d ^aa/ °^ Sections 5.1 and 5.2 unfortunately are valid under multi- 
normal densities only (more precisely, as we shall see, under densities with 
Gaussian kurtosis). It is not difficult, however, to extend their validity to 
the whole class of elliptical populations with finite fourth-order moments, 
while maintaining their optimality properties at the multinormal. 

Let us first introduce the following notation. For any g± E J-f, let (as in 
Lemma 4.2) D k (g x ) := fik+im/^-i;gi = o- 2 E#. gi [d$(0, V)] = {G~ k l {u)) 2 du 
and E k ( 9l ) := a~^, gi [df(9, V)] = J^G^(u))Uu, where G lk (r) := 
(/ifc_ 1;gi ) -1 Jq s k ~ 1 gi(s) ds; see Section 2.1. Then 

k E k (gi) 
k + 2Dl( 9l ) 

is the kurtosis of the elliptic population with radial density gi [see, e.g., 
page 54 of Anderson (2003)]. For Gaussian densities, E k (4>i) = k(k -\-2)/a k , 
D k ((pi) = k/a k and K k {(f)i) = 0. 

Since the asymptotic covariance matrix of A^ >1 under P^) gi (with # E 

and g\ E Ff) is (a k E k (gi) / 'k(k + 2))r^ 1 , it is natural to base our 
pseudo-Gaussian tests on statistics of the form [compare with the fi = <j)\ 
version of (5.1)] 

( n ) ._ k(k + 2) IV/ IV p Q (Tt /3 y r iv p /3 wp/Wl \IV 

k 2 

= (i + Mgi))" 1 n2 / \ f Q&ti =: ^ + K fc(5'i))~ 1( 3i?oX(5i)- 

D ki9i)a k 

As in the Gaussian case, and with the same $ as in (5.2), Lemma 4.2 entails 
that qf™ = Q^o^j + o P (l), as n -)• oo under P^. fll , with i? E %q and 
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gi G J-f. Since a" 2 /|S^] 1//fc consistently estimates k/Dk(gi) under P^. gi) 
with t?o £ %o and <7i G Tf , it follows from Slutsky's lemma that 

0- ^ q(") 

^'•^ |S(")|V*aP*.^ 

satisfies = Q^j^f = Q# ,A/'(5 , i) + o p(1) as n — )■ oo, still under %q , 51 G Ff. 
The pseudo-Gaussian test 4>q-n* we propose is based on 

(5.8) g^a+^n 1 ^, 

where £ fc := {kn~ l £"=1 df)/(.(k + 2)( n - 1 Y? i=l djf) ~h with £ := S< ft >). 
The statistic Qyjr,, indeed remains asymptotically chi-square [(A; — 1) degrees 
of freedom] under Hq'i for any g± G J 7 ^. Note that 4^^* ls obtained from 

(n) 

rfj-jsf by means of the standard kurtosis correction of Shapiro and Browne 
(1987), and asymptotically coincides with <?^Tyier' see 0-^)- 

(n) 

Local powers for (fip-N* classically follow from applying Le Cam's third 

lemma. Let tW := ((t^WJ'.t^W, (t 111 ^)', (t iv ^)')', with t^M") uni- 
formly bounded, where t iv ^ = vec(b( n )) is a perturbation of vec(/3 ) = 
vec(/3 ,/3 2 ,..-,/3fc) such that /3' b, with b = (b[, . . . ,b' k )' := lim^oo b( n \ 
is skew-symmetric; see (3.2) and (3.3). Assume furthermore that the corre- 
sponding perturbed value of #0 G Hq does not belong to , that is, bi 7^ 0, 
and define 

r ^o;r : = ^S wech{n) )' Gl t diag(^" 2 1 , • • • , f u !,0i xM )(fc-i)/ 2 ) 

(5.9) x (Gf°)'(vecb (n) ) 

= 4 E-i} 1 (^b 1 ) 2 . 

i=2 

The following result summarizes the asymptotic properties of the pseudo- 
Gaussian tests 4>^\f^ • Note that optimality issues involve Hq [hence require 

Assumption (A)], while validity extends to Hq'i [which only requires As- 
sumption (A^)]. 

Proposition 5.1. (i) Qjtfl is asymptotically chi-square with (k — 1) de- 
grees of freedom under U^g-^ 3 ' U gi eTf{P ( 'frg 1 } > an d asymptotically noncen- 
tral chi-square, still with (k — 1) degrees of freedom, but with noncentrality 
parameter rf. T /4(l + K k (g{)) under ^ +n -i/2 T ( n) , gi , with & e H%, gi G T^, 
and as described above; 
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(ii) the sequence of tests 4>^\f^ rejecting the null whenever Q^fl exceeds 
the a upper- quantile xl— x-i—a °f ^ e chi-square distribution with (k — 1) 
degrees of freedom has asymptotic size a under U# e ^' V} gi ^r A ^'^g 1 ^ > 

(iii) the pseudo- Gaussian tests (^j^ are asymptotically equivalent, un- 
d er U^g-^/ 3 ' {P^jl an d contiguous alternatives, to the optimal parametric 
Gaussian tests (p0j^> hence, the sequence ^g//^ is locally and asymptoti- 
cally most stringent, still at asymptotic level a, for U^ e ^' U gi e.F4{P^ i } 
against alternatives of the form U^^^iP^^}- 

Of course, since kk is invariant under <7 ro t,o, the pseudo-Gaussian test 
inherits the £/ ro t ]0 -invariance features of the Gaussian one. 

5.4. Optimal pseudo- Gaussian tests for eigenvalues. As in the previous 
section, the asymptotic null distribution of the Gaussian test statistic 
is not standard normal anymore under radial density g\ as soon as Kk(gi) ^ 

K/c((/>i). The Gaussian test </>a-!v ^ nus * s no * van d (does not have asymptotic 
level a) under such densities. The same reasoning as before leads to a similar 
kurtosis correction, yielding a pseudo-Gaussian test statistic 

where := n 1 /2( ap (? (A s ))-i/2 ((1 -p)£*^ +1 A j;S -p£*=i A j;S ) and k k 

is as in Section 5.3. This statistic coincides with T^ vis given in (1.4). 

Here also, local powers are readily obtained via Le Cam's third lemma. Let 
r (n) . = (( T /(n)y 5T //(n) ; (jUl{n)y ^ ^.lV{n)yy ^ with r (n)/ T (n) uniformly bounded, 

where t 111 ^ := dvec(l( n )) is such that 1 := lim^oo l( n ) := lim^oo diag^i^, 

. . . ,t k ) satisfies tr(A^- x l) = [see (3.1) and the comments thereafter], and 
define 

k q 

(5.10) rj£:= lim grad/i(dvec(A v ))V /// ( n ) = (l-p) lj-pj^lj- 

i=?+i j=i 

The following proposition summarizes the asymptotic properties of the re- 

(n) 

suiting pseudo-Gaussian tests .JW 

Proposition 5.2. (i) is asymptotically normal, with mean zero 

under \J#eH£.% U ffl e^{ P ^gi}' mean ( 4a P,g( A v)(l + «fe(ffi))~ 1/2 ^ under 

^ > ^+ n -i/2 T (n). fl ' ^ ^ ^o"' ffl ^ Fa an d T ^ as described above, and variance 
one under both; 
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(ii) the sequence of tests 4>\-n* rejecting the null whenever Tj^l is less 
than the standard normal a-quantile z a has asymptotic size a under 

(iii) the pseudo-Gaussian tests ^kIn* are asymptotically equivalent, un- 
der {Pj}^ } and contiguous alternatives, to the optimal parametric 
Gaussian tests (p/^Kf; hence, the sequence ^A-iv* is locally and asymptoti- 
cally most powerful, still at asymptotic level a, for Ui?eH A " U eT 4 ^ 3 ^ } 
against alternatives of the form U^-n^Ptf 1 ^} ■ 

6. Rank-based tests for principal components. 

6.1. Rank-based statistics: Asymptotic representation and asymptotic nor- 
mality. The parametric tests proposed in Section 4 are valid under specified 
radial densities f\ only, and therefore are of limited practical value. The im- 
portance of the Gaussian tests of Sections 5.1 and 5.2 essentially follows 
from the fact that they belong to usual practice, but Gaussian assumptions 
are quite unrealistic in most applications. The pseudo-Gaussian procedures 
of Sections 5.3 and 5.4 are more appealing, as they only require finite fourth- 
order moments. Still, moments of order four may be infinite and, being based 
on empirical covariances, pseudo-Gaussian procedures remain poorly robust. 
A straightforward idea would consist in robustifying them by substituting 
some robust estimate of scatter for empirical covariance matrices. This may 
take care of validity-robustness issues, but has a negative impact on powers, 
and would not achieve efficiency-robustness. The picture is quite different 
with the rank-based procedures we are proposing in this section. While re- 
maining valid under completely arbitrary radial densities, these methods 
indeed also are efficiency-robust; when based on Gaussian scores, they even 
uniformly outperform, in the Pitman sense, their pseudo-Gaussian counter- 
parts (see Section 7). Rank-based inference, thus, in this problem as in many 
others, has much to offer, and enjoys an extremely attractive combination 
of robustness and efficiency properties. 

The natural framework for principal component analysis actually is the 
semiparametric context of elliptical families in which 9, dvec(Av), and (3 
(not a 2 ) are the parameters of interest, while the radial density / [equiv- 
alently, the couple (cr 2 ,/i)] plays the role of an infinite-dimensional nui- 
sance. This semiparametric model enjoys the double structure considered in 
Hallin and Werker (2003), which allows for efficient rank-based inference: 
the fixed- /i subexperiments, as shown in Proposition 3.1 are ULAN, while 
the fixed-(#, dvec(Av), (3) subexperiments [equivalently, the fixed-(#,V) 
subexperiments] are generated by groups of transformations acting on the 

observation space. Those groups here are of the form Gglr, ° and consist of 
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(n) 

the continuous monotone radial transformations Q\ 

e< ft) (x 1} . . . ,x n ) = Q { h n \e + di(e, vjv^UiCe, v), . . . , 

+ d n (0,V)V 1 /2 Un (0,V)) 
— (fl + ^di^V^vVaUi^V),..., 
+ / l (d n (0,V))V 1 / 2 U n (6>,V)) J 

where /t:M + — >■ M + is continuous, monotone increasing, and satisfies 
linv^oo h(r) = oo and h(0) = 0. The group Gg%, generates the fixed-(#, V) 

family of distributions I J_ 2 L J * {P^n , ° , . , ,„ , }. The general results 

wo- wj! i 0,t7^,dvcc(Av),vcc(/3);/i J 

of Hallin and Werker (2003) thus indicate that efficient inference can be 
based on the corresponding maximal invariants, namely the vectors 

(i?J n) (0, V), ... , 4") (0, V), Ui(0, V), . . . , U„(0, V)) 

of ranks and multivariate signs, where 12^ (0, V) denotes the rank of di (6, V) 
among di(0, V), . . . , d n (9, V). Test statistics based on such invariants auto- 
matically are distribution-free under I I 2 U* |p1™^o , o , . > „ }■ 

wcr J l 0.CT^,dvec(Av),voc(/3);/i J 

Letting ifc := Ri(8,V) and U 4 := Ui(0, V), define 



and 



A lll 



A IV 



1 / n / \ 

^mJvh 4 (a v 1 /V)«'E^(^ i )™c(u,u; 

1=1 



2^GfLj- A v ( V* 2 )-V^ vec(U,U0. 

i=l 

Associated with A^? K and A.^ K , let 



A 

and 



:= ^ G " L "' AV ( V " 2 )" 1/2 E^(^{^^)) vec(U 4 Uj), 

where Gifc is as in Section 5.3. The following proposition provides an asymp- 
totic representation and asymptotic normality result for A^/^ and A^ 1 ^-. 



Proposition 6.1. Let Assumption (S) hold for the score function K. 
Then: 
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(i) (asymptotic representation) (Ag, A{^)' = (Ag^, A^ igl )' + 

Oj,2 (1) as n-^oo, under P#? gi , f or an V # £ ® and g\ € T\ ; 

(ii) (asymptotic normality) let Assumption (A) hold and consider a bound- 
ed sequence r( n ) := ((t 7 ^)', t 11 ^, (t 777 ^)', {t iv{ ^)')' such that both t 111 := 
lim^ooT 777 ^ andr IV :=\im n ^ 00 T Iv ^ exist. Then (A 777 £ ffl , A 7 ^)' is 
asymptotically normal, with mean zero and mean 



Jk(K, 9l ) ( D fc (A v )r 



111 



k(k + 2) V 3 G f diagCi/u 1 , . . . , i/^CGf )'r IV 
[where Jk(K,g\) was defined in (2.5)], under P^) gi (any # G and gi € 
J 7 !^ and P^ n _i/2 T (») # £ @ «nd 5i G -?aj. respectively, and block- 

diagonal covariance matrix diag(T^f K , T^( K ) under both, with 

r m ._ ^7"fc(-^) / A s 
- fc(^) Dfc(Av) 

and 

(fU) r ^ := i^fk G ^ diagK ~ 21 ' • • • ' <-d* )(g * y - 

The proofs of parts (i) and (ii) of this proposition are entirely similar to 
those of Lemma 4.1 and Proposition 4.1, respectively, in Hallin and Pain- 
daveine (2006a), and therefore are omitted. 

In case K = K$ x is the score function associated with f\ G J a , and pro- 
vided that Assumption (A) holds (in order for the central sequence A,?.^ of 

Proposition 3.1 to make sense), A 7 / 7 ^. ^ and A^Y^ ^, under P^^ clearly 

coincide with A 7 / 7 ,- and A^Y* . Therefore, A 7 / 7 ^ and AfJC constitute 

j J 1 ),/ 1 ' Jl ' /l 

rank-based, hence distribution-free, versions of those central sequence com- 
ponents. Exploiting this, we now construct signed-rank tests for the two 
problems we are interested in. 

6.2. Optimal rank-based tests for eigenvectors. Proposition 6.1 provides 
the theoretical tools for constructing rank-based tests for %q and computing 
their local powers. Letting again $o := (0',a 2 , (dvec Ay)', (vec/3 )')', with 
(3 = (/3°,/3 2 , . . . define the rank-based analog of (4.2) 

(6.2) fc 

_ nfc(fe + 2) ^ , (n) 2 

where S& := sE2=i^(^8r^)Ut(0,V)Uj(0,V). 
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In order to turn Q^. K into a genuine test statistic, as in the parametric 

case, we still have to replace #o with some adequate estimator # satisfying, 
under as large as possible a class of densities, Assumption (B) for Hq. In par- 
ticular, root-n consistency should hold without any moment assumptions. 
Denote by #hr the Hettmansperger and Randies (2002) affine-equivariant 
median, and by Vx y i Gr the shape estimator of Tyler (1987), normalized 
so that it has determinant one: both are root-n consistent under any ra- 
dial density g\. Factorize Vxyicr into /3 Tyler AT y i C r/9Tyicr- The estimator we 
are proposing (among many possible ones) is i9 = (0' HR , cr 2 , (dvec Axyier)', 
(vec/3 o yy> where the constrained estimator f3 := (/3°,/3 2 ) • • • i@k) is con- 
structed from /3Tyier y i a the same Gram-Schmidt procedure as was applied 
in Section 5.1 to the eigenvectors /3 V of V := S^/|S^ n ^| 1 / fc ; note that a 2 
does not even appear in Q^ yK , hence needs not be estimated. 

In view of (6.2), 



( n ) _ n( n ) 



QW = Q 



-3) = ^^-^f^h ~ /3°/3°')S^/3 



where the ranks and signs in S^. K are computed at that is, R\ 

R^\d UR ,P A Tylci p' ) and Uj = U 4 (0 H R, /9 A Ty icr3o). 

Let us show that substituting i9 for tDq in (6.2) has no asymptotic impact 
on Q^. K — that is, Q^. K - Q£ = °p(1) as n-» oo under P^> ;fll , with g 1 G 

Jo, £ Hq'i ■ The proof, as usual, relies on an asymptotic linearity property 
which, in turn, requires ULAN. The ULAN property of Proposition 3.1, 
which was motivated by optimality issues in tests involving (3 and Ay , here 
cannot help us, as it does not hold under Assumption (A^). Another ULAN 
property, however, where Assumption (A) is not required, has been obtained 
by Hallin and Paindaveine (2006a) for another parametrization — based on 
(0, a 2 , V) — of the same families of distributions, and perfectly fits our needs 
here. 

Defining J*. := (veclfc)(veclfc)' and := 1^2 — ^ Jjfc, it follows from Propo- 
sition A.l in Hallin, Oja and Paindaveine (2006) and Lemma 4.4 in Kreiss 
(1987) that, for any locally asymptotically discrete [Assumption (B3)] and 
root-n consistent [Assumption (B2)] sequence (0W,VW) of estimators of 
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location and shape, one has 



(6.4) + 



4k(k + 2) 
= op(l) 



2 

I fc 2 + K fc - -3 k 



( V - 1 /2)®2 n l/2 vec( y(n)_ v) 



as n — >• oo under P# . ffl j with #o £ © an d 9i ^ ^a- This result readily applies 

to any adequately discretized version of (0hr> /So-^Tyier/^o) a * #o £ %oi- It is 
well known, however, that discretization, although necessary for asymptotic 
statements, is not required in practice [see pages 125 or 188 of Le Cam and 
Yang (2000) for a discussion on this point]; we therefore do not emphasize 
discretization any further in the notation, and henceforth assume that i?, 
whenever needed, has been adequately discretized. 

Using (6.4) and the fact that [/3 01 ® (I fe - P°/3°')]J k = 0, we obtain, 
under P^. fll with i?q G T~Lq'i an d Q\ € -^a, since Kfc(vec A) = vec(A') for 
any k x k matrix A and since (3° under i?q G %o i i s an eigenvector of 
V-V^At^&V-Vs, 

v^[/3 ' (I fe - /3°/3 ')] vec(S - S $ K ) 



Jk(K,gi) 
2k(k + 2) 

+ op(l) 



7i 



s (Ifc _ /3 o /3 o 0](v -i/2 ) ® 2 vec( ^ o A Tyler ^ _ V ) 



+ op(1) 
= o P (l), 

as n — > oo. In view of (6.3), we therefore conclude that — Q^. K = op(l) 

as n — > oo, still under under #o G %o-i' as was to be shown. 
The following result summarizes the results of this section. 

Proposition 6.2. Let Assumption (S) hold for the score function K . 
Then: 

(i) Q£ is asymptotically chi-square with (k — 1) degrees of freedom un- 
d er U.# g -^' Ug 1 eJ r a {^frg 1 } > an< ^ as y m Pt°ti' ca tty noncentral chi-square, still 
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with (k — 1) degrees of freedom, and noncentrality parameter 

Ak{k + 2)J k {K) 

under P^ n _i/ 2x („). 9 > for "d G T~Lq and g\ 6 J- a , with as in Proposi- 

tion 5.1 and r^. T defined in (5.9); 

(ii) the sequence of tests 4$k rejecting the null when exceeds the a 

upper- quantile of the chi-square distribution with (k— 1) degrees of freedom 
has asymptotic size a under \J^ en /3i{J gie jr a {P^ gi } ; 

(iii) for scores K = Kj 1 , with f\ £ T a , 4>^ K is locally asymptotically most 
stringent, at asymptotic level a, for {J^^pr {J gi eF a {Pfr gi } against alterna- 



tives 



of the form U^ H p{^- h }- 



Being measurable with respect to signed-ranks, is asymptotically 

invariant under continuous monotone radial transformations, in the sense 
that it is asymptotically equivalent (in probability) to a random variable 
that is strictly invariant under such transformations. Furthermore, it is easy 
to show that it enjoys the same £/ ro t,o-invariance features as the parametric, 
Gaussian, or pseudo-Gaussian test statistics. 

6.3. Optimal rank-based tests for eigenvalues. Finally, still from the re- 
sults of Proposition 6.1, we construct signed-rank tests for the null hypothe- 
sis %q. A rank-based counterpart of (4.5) and (4.6) [at t?o = (6' , a 2 , (dvec Ao)', 
(vec/3)')' € is, writing V for (3A (3', 

T { Z,K = (grad / / l (dvecA )(r^ ;/ ,)- 1 grad/ i (dvecA ))- 1 / 2 
(6.5) x grad / / l (dvecAo)(r^ ;i ,)- 1 A^. K 



nk(k + 2) 
Jk(K) 



y /2 (o p , 3 (Ao))-V 2 c; )g dvec(A; / Vs£ i ,/3A^ 2 ). 



Here again, we have to estimate i?o- Note that, unlike the quantity 
grad' /i(dvec Ao)(r^.^ i )~ 1 A^.^ appearing in the Gaussian or pseudo-Gaus- 
sian cases, grad'/i(dvec Ao)(r^. ftr )~ 1 A^. A - does depend on Ao [see the 
comments below (5.4)]. Consequently, we have to carefully select an estima- 
tor ■& that has no influence on the asymptotic behavior of T^. K under T-L^'q- 
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To this end, consider Tyler's estimator of shape VTyier(=: /^Tyier ^Tyier/^Tyier' 
with obvious notation) and define 

dvec(A Tyler ) := (I fe - Cp^c^Cp^ -1 ^ J(dvec A Tylcr ). 

Then the estimator of shape Ay := AT y ier/|ATyier| 1//fc is clearly constrained: 
c' (dvec Ay) = and |Av| = 1- The resulting preliminary estimator ft is 

(6.6) ft := (0' RR ,a 2 , (dvec Ay)', (vec^^)')', 

where (?hr still denotes the Hettmansperger and Randies (2002) affine- 
equivariant median. The test statistic we propose is then 

Ik —l#. K 

= (grad'/i(dvec A v )(r^)^ grad/i(dvec A v ))~ 1/2 

(6.7) x grad^(dvecA v )(r^)- 1 A^ 

j k{K) ) (<W A v)) 1/2 c' M 

X dvec(Av /2 ^T y lcrS^^ T ylerAv /2 ), 

where sg, := ± E?=i K^ti^ with := i^(0 H R ^Tyler^'iyier) 

and Uj := Uj(0HR 5 /^Tyicr-^-V/flTyicr)- The following lemma shows that the 
substitution of ft for ft in (6.6) has no asymptotic effect on T^ K (see the 
Appendix for a proof). 

Lemma 6.1. Fix ft e H^i and g\ 6 J- a , and let ft be the estimator 
in (6.6). Then T^ — T$) K is op(l) as n— >oo, under P^) . 

The following result summarizes the results of this section. 

Proposition 6.3. Let Assumption (S) hold for the score function K . 
Then: 

(i) is asymptotically standard normal under {J^^ah \J gi ^ a {P^ gi } > 

and asymptotically normal with mean 

Jk{K, gi) Av 

^4k(k + 2)a, M (A v )J k (K) r »> T 



RANK-BASED TESTS FOR PCA 37 



as in 



and variance 1 under P^. n -i/2 T (n) , g > w ^ & £ %c^ v > 9\ £ Fa, T ^ 
Proposition 5.2, and defined in (5.10); 

(ii) the sequence of tests (jyJ^K rejecting the null whenever Tjc is less 
than the standard normal a-quantile z a has asymptotic size a under 



(iii) for scores K = Kf 1 with /i £ J , the sequence of tests 



is 



locally and asymptotically most powerful, still at asymptotic level a, for 
UtfeH^U^ej-jP^J against alternatives of the form U^w^P^}. 

7. Asymptotic relative efficiencies. The asymptotic relative efficiencies 
(AREs) of the rank-based tests of Section 6 with respect to their Gaussian 
and pseudo-Gaussian competitors of Sections 5 are readily obtained as ratios 
of noncentrality parameters under local alternatives (squared ratios of stan- 
dardized asymptotic shifts for the one-sided problems on eigenvalues). De- 
noting by ARE^ {4>i 14^2 ) * ne ARE, under local alternatives of the form 
P^ n -i/2 T . Sl ' °f a sequence of tests with respect to the sequence <j>^ \ 
we thus have the following result. 

Proposition 7.1. Let Assumptions (S) and (B) hold for the score func- 
tion K and (with the appropriate null hypotheses and densities) for the es- 
timators i9 described in the previous sections. Then, for any g± 6 



k(k + 2)J k (K) 

Table 1 provides numerical values of these AREs for various values of the 
space dimension k and selected radial densities g± (Student, Gaussian and 
power-exponential), and for the van der Waerden tests <^^ dW and ^A-vdW 

the Wilcoxon tests 4^Ki an< ^ ^Jfi' anc ^ * ne Spearman tests 4>q-k2 an< ^ 
4a- K 2 (* ne score functions K a , a > were defined in Section 2.2). These val- 
ues coincide with the "AREs for shape" obtained in Hallin and Paindaveine 
(2006a), which implies [Paindaveine (2006)] that the AREs of van der Waer- 
den tests with respect to their pseudo-Gaussian counterparts are uniformly 
larger than or equal to one (an extension of the classical Chernoff-Savage 
property): 

inf ARE^(^i dW /^J = in;AREf; i( ^") dW /^%) = 1. 



38 M. HALLIN, D. PAINDAVEINE AND T. VERDEBOUT 

Table 1 

AREs of the van der Waerden (vdW), Wilcoxon (W) and Spearman (SP) rank-based 
tests (j>p) K and (j)J^ K with respect to their pseudo-Gaussian counterparts, under 

k- dimensional Student (with 5, 8 and 12 degrees of freedom), Gaussian, and 
power-exponential densities (with parameter rj — 2,3,5), for k = 2, 3, 4, 6, 10, and k — > oo 



Underlying density 



K 


k 


is 


ts 


*12 




e 2 


e 3 


e 5 


vdW 


2 


2.204 


1.215 


1.078 


1.000 


1.129 


1.308 


1.637 




3 


2.270 


1.233 


1.086 


1.000 


1.108 


1.259 


1.536 




4 


2.326 


1.249 


1.093 


1.000 


1.093 


1.223 


1.462 




6 


2.413 


1.275 


1.106 


1.000 


1.072 


1.174 


1.363 




10 


2.531 


1.312 


1.126 


1.000 


1.050 


1.121 


1.254 




OC 


3.000 


1.500 


1.250 


1.000 


1.000 


1.000 


1.000 


W 


2 


2.258 


1.174 


1.001 


0.844 


0.789 


0.804 


0.842 




3 


2.386 


1.246 


1.068 


0.913 


0.897 


0.933 


1.001 




4 


2.432 


1.273 


1.094 


0.945 


0.955 


1.006 


1.095 




6 


2.451 


1.283 


1.105 


0.969 


1.008 


1.075 


1.188 




10 


2.426 


1.264 


1.088 


0.970 


1.032 


1.106 


1.233 




oo 


2.250 


1.125 


0.938 


0.750 


0.750 


0.750 


0.750 


SP 


2 


2.301 


1.230 


1.067 


0.934 


0.965 


1.042 


1.168 




3 


2.277 


1.225 


1.070 


0.957 


1.033 


1.141 


1.317 




4 


2.225 


1.200 


1.051 


0.956 


1.057 


1.179 


1.383 




6 


2.128 


1.146 


1.007 


0.936 


1.057 


1.189 


1.414 




10 


2.001 


1.068 


0.936 


0.891 


1.017 


1.144 


1.365 




oo 


1.667 


0.833 


0.694 


0.556 


0.556 


0.556 


0.556 



8. Simulations. In this section, we investigate via simulations the finite- 
sample performances of the following tests: 

(i) the Anderson test <^ Anderson' ^ ne optimal Gaussian test (p^\f, the 

pseudo-Gaussian test <fip/j>/*, the robust test 4>frTy\ eT based on Q^yi er , and 

various rank-based tests (f>a) K (with van der Waerden, Wilcoxon, Spearman 
and sign scores, but also with scores achieving optimality at t%, and t§ 
densities), all for the null hypothesis Hq on eigenvectors; 

(ii) the optimal Anderson test <^Anderscm = ^A-iv ^ e pseudo-Gaussian 
test 0^.^* = ^A-Davis Dase d on T^ vis , and various rank-based tests 4>^ K 
(still with van der Waerden, Wilcoxon, Spearman, sign, t±, £3 and scores), 
for the null hypothesis on eigenvalues. 

Simulations were conducted as follows. We generated N = 2500 mutually 
independent samples of i.i.d. trivariate (k = 3) random vectors ee : j, I = 
1, 2, 3,4, j = 1, . . . , n = 100, with spherical Gaussian (eiy), £5 (s2;j), £3 (^3;j) 
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(10 









H 





4 









V o 





U 





and t\ (e4 ; j) densities, respectively. Letting 

'cos(vr£/12) -sin«/12) 0' 
sin(</12) cos«/12) 
1 

and 

/3f o N 
L 5 := 
\ 0, 

each ei-j was successively transformed into 

(8.1) X (:/;i \i.A 2 e l:i . £ = l,2,3,4,j = l,...,n,£ = 0,...,3, 
and 

(8.2) Y, ;i;C = (A + L | ) 1 / 2 e, ;i , £ = 1, 2, 3, 4,j = 1, . . . ,n,£ = 0, . . . ,3. 

The value £ = corresponds to the null hypothesis : j3 l = (1, 0, 0)' for the 
X^.j.^'s and the null hypothesis Hq : Ylj= q +i ^j;v/Ylj=i -\?';V = 1/3 (with 
<7 = 1 and A; = 3) for the Y^.j.^'s; £ = 1, 2, 3 characterizes increasingly distant 
alternatives. We then performed the tests listed under (i) and (ii) above in 
N = 2500 independent replications of such samples. Rejection frequencies 
are reported in Table 2 for T{q and in Table 3 for Ti^. 

Inspection of Table 2 confirms our theoretical results. Anderson's 0^ Anderson 
meets the level constraint at Gaussian densities only; <^xyicr (equivalently, 

further survives the t§ but not the t$ or t\ densities which have infi- 
nite fourth-order moments. In contrast, the rank-based tests for eigenvectors 
throughout satisfy the nominal asymptotic level condition (a 95% confidence 
interval here has half- width 0.0085). Despite the relatively small sample size 
n = 100, empirical power and ARE rankings almost perfectly agree. 

The results for eigenvalues, shown in Table 3, are slightly less auspicious. 
While the Gaussian and pseudo-Gaussian tests remain hopelessly sensitive to 
the violations of Gaussian and fourth-order moments, respectively, the rank 
tests, when based on asymptotic critical values, all significantly overreject, 
indicating that asymptotic conditions are not met for n = 100. We therefore 
propose an alternative construction for critical values. Lemma 6.1 indeed 
implies that the asymptotic distribution of the test statistic Tjs (based 

on the ranks and signs of estimated residuals) is the same, under P^L^ 
$0 £ as that of K (based on the ranks and signs of exact residu- 

als, which are distribution- free) . The latter distribution can be simulated, 
and its simulated quantiles provide valid approximations of the exact ones. 
The following critical values were obtained from M = 100,000 replications: 
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Table 2 

Rejection frequencies (out of N = 2500 replications), under the null and increasingly 
distant alternatives (see Section 8 for details), of the Anderson test ^^Anderson) ^ e Tyler 
test </^xyier> the parametric Gaussian test ^a-M' its pseudo- Gaussian version 4>a-jis*> and 
the signed-rank tests with van der Waerden, t v (v= 1, 3, 5), sign, Wilcoxon, and 
Spearman scores, ^j^.^dw ^S'ti ? ^s-W' ^jS-SP' respectively. Sample size is 

n = 100. All tests were based on asymptotic 5% critical values 



Test 01230123 



N t 5 



An) 

^(3; Anderson 





0572 





3964 





,8804 





9852 





,2408 





,4940 





,8388 





,9604 




0. 


,0528 





,3724 





,8568 





,9752 





,2284 





,4716 





,8168 





,9380 


V/3;Tylcr 





0572 





3908 





,8740 





9856 





0612 





,2520 





,6748 





,8876 







,0524 





,3648 





,8512 





9740 





,0544 


0, 


,2188 


0, 


,6056 





,8156 


An) 





,0368 





2960 





,8032 





9608 





,0420 


0, 


2328 


0, 


6908 





,9056 







,0452 





,3204 





,8096 





9596 





,0476 


0, 


,2728 


0, 


,7440 





,9284 


?/3;*i,3 





,0476 





,3104 





,7988 





9532 





,0496 


0. 


,2760 


0, 


7476 





,9280 


6 {n) 





,0488 





,2764 





,7460 





9220 





,0552 


0, 


,2652 


0, 


7184 





,9024 


r/3;S 





,0448 





,2268 





,6204 





8392 





,0496 


0, 


2164 


0. 


6236 





,8324 







,0456 





,3144 





,8012 





9556 





,0484 


0, 


,2808 


0, 


,7464 





9320 







,0444 





3096 





,8160 





,9576 





,0464 


0. 


,2548 


0. 


,7068 





9152 










t 3 
















h 










^{3; Anderson 


0. 


4772 





6300 





,8532 





9452 





,9540 





,9580 





9700 





,9740 







,4628 





,6040 





,8304 





9168 





9320 





,9384 





9472 





,9480 


^Tylcr 





,0892 





,2248 





,5364 





7508 





,5704 


0, 


,5980 


0, 


,6584 





,7444 







0616 





,1788 





,4392 





6092 





,4516 


0, 


4740 


0, 


5160 





,5624 


An) 

?3iv(iw 





,0444 





,2172 





,6464 





,8676 





,0472 


0, 


1656 


0, 


5104 





,7720 


<f> (n) 





,0488 





,2628 





,7120 





9076 





,0560 





2100 


0, 


,6068 





,8508 


?/3;*i,3 





,0500 





,2728 





,7156 





9116 





,0576 


0, 


2156 


0, 


,6292 





,8672 







,0476 





,2688 





.7100 





9084 





,0548 





,2256 


0, 


6600 





,8856 


<b (n) 

Z/3;S 





,0492 





,2202 





,6188 





8352 





0512 


0, 


2116 


0, 


6172 





,8448 


I/3;W 





,0520 





,2708 





,7136 





9120 





,0552 


0. 


2148 


0, 


6148 





,8604 


™/3;SP 





,0544 





2436 





,6648 





8776 





,0580 


0. 


,1824 


0, 


,5200 





,7740 
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Table 3 

Rejection frequencies (out of N = 2500 replications), under the null ri A and increasingly 
distant alternatives (see Section 8), of the optimal Gaussian test cj>f^\f = 4>a- Anderson > ^ s 

pseudo- Gaussian version <j>^N'* = ^jVDavis' an< ^ ^ e signed-rank tests with van der 
Waerden, t u (v — 1, 3, 5), sign, Wilcoxon, and Spearman scores ^y^idW Va-^ > ^yvs' 

4>s?-W' Sample size is n — 100. All tests were based on asymptotic 5% critical 

values and (in parentheses) simulated ones 



Test 











1 




2 






3 














M 










4"k = 

An) 

VA;vdW 


A; Anderson 

An) 

VA^Davis 


0.0460 
0.0432 
0.0608 


(0.0480) 


0.4076 
0.3976 
0.4604 


(0.4116) 


0.8308 
0.8220 
0.8576 


(0 


.8280) 


0.9604 
0.9572 
0.9668 


(0.9596) 


L 1,5 




0.0728 


(0.0480) 


0.4804 


(0.3972) 


0.8572 


(0. 


8116) 


0.9644 


(0.9504) 


6 (n) 

?A;t I>3 




0.0748 


(0.0496) 


0.4804 


(0.3884) 


0.8524 


(0. 


,7964) 


0.9612 


(0.9432) 






0.0780 


(0.0504) 


0.4532 


(0.3572) 


0.8160 


(0 


,7320) 


0.9448 


(0.9112) 






0.0864 


(0.0508) 


0.3980 


(0.3088) 


0.7384 


(0. 


,6408) 


0.9028 


(0.8552) 


<A (n) 




0.0744 


(0.0480) 


0.4816 


(0.3908) 


0.8544 


(0. 


,8012) 


0.9640 


(0.9464) 






0.0636 


(0.0460) 


0.4664 


(0.4096) 


0.8564 


(0 


,8200) 


0.9668 


(0.9584) 


Q A;J\[* - 

An) 

?A;vdW 


A; Anderson 

,(n) 

^AiDavis 


0.1432 
0.0504 
0.0692 


(0.0548) 


0.4624 
0.2768 
0.4256 


(0.3772) 


0.7604 
0.5732 
0.7720 


(0 


,7404) 


0.9180 
0.7988 
0.9444 


(0.9324) 






0.0736 


(0.0492) 


0.4544 


(0.3772) 


0.7980 


(0. 


7372) 


0.9524 


(0.9332) 


<A (n) 

?A;t 1>3 




0.0732 


(0.0452) 


0.4576 


(0.3748) 


0.7968 


(0. 


,7320) 


0.9524 


(0.9288) 


?A;tx ! 




0.0776 


(0.0416) 


0.4448 


(0.3484) 


0.7832 


(0 


,6952) 


0.9436 


(0.9116) 


?A;S 




0.0768 


(0.0436) 


0.4060 


(0.3172) 


0.7180 


(0. 


,6360) 


0.9100 


(0.8592) 


?A;W 




0.0732 


(0.0456) 


0.4512 


(0.3756) 


0.7972 


(0. 


,7364) 


0.9524 


(0.9308) 


?A;SP 




0.0764 


(0.0544) 


0.4360 


(0.3736) 


0.7776 


(0 


,7304) 


0.9480 


(0.9300) 



— 1.7782 for van der Waerden, —1.8799 for is-scores, —1.8976 for t3-scores, 

— 1.9439 for ti-scores, —1.9320 for sign scores, —1.8960 for Wilcoxon and 

— 1.8229 for Spearman. Note that they all are smaller than —1.645, which is 
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Table 3 
( Continued.) 



Test 











1 




2 






3 












ts 












Pa-X* - 

6 (n) 

j_ A;vdW 


«,(«) 
A; Anderson 

^A;Davis 


0.2572 
U.UoOo 
0.0708 


(0.0560) 


0.5308 

n 1 7QQ 
U.l (OO 

0.4088 


(0.3260) 


0.7200 

U.o I U4 
0.7540 


(0. 


.7040) 


0.8596 

U.04OD 

0.9304 


(0.9140) 


L A ;il,5 




0.0812 


(0.0544) 


0.4472 


(0.3524) 


0.7936 


(0. 


.7240) 


0.9416 


(0.9208) 


r A ;*i,3 




0.0832 


(0.0560) 


0.4556 


(0.3568) 


0.7944 


(0. 


,7256) 


0.9452 


(0.9192) 
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0.0924 


(0.0548) 


0.4464 


(0.3400) 


0.7812 


(0. 


.7024) 


0.9364 


(0.8996) 


j_ A;S 




0.0936 


(0.0604) 


0.4104 


(0.2928) 


0.7320 


(0. 


.6404) 


0.9012 


(0.8528) 


<f> (n) 

fA;W 




0.0832 


(0.0572) 


0.4488 


(0.3580) 


0.7956 


(0. 


7272) 


0.9448 


(0.9180) 


?A;SP 




0.0796 


(0.0576) 


0.4212 


(0.3412) 

ti 


0.7572 


(0. 


.7020) 


0.9276 


(0.9044) 


•PaiAT. - 

-(») 

?A;vdW 


«,(«) 
A; Anderson 
An) 

fAiDavis 


0.7488 
0.0072 
0.0724 


(0.0596) 


0.8000 
0.0080 
0.3500 


(0.3032) 


0.8288 
0.0172 
0.6604 


(0 


.6176) 


0.8528 
0.0296 
0.8600 


(0.8332) 


?A;t!,5 




0.0824 


(0.0512) 


0.3836 


(0.3120) 


0.7312 


(0 


.6492) 


0.9036 


(0.8664) 


?A;*i,3 




0.0828 


(0.0532) 


0.3936 


(0.3108) 


0.7488 


(0. 


,6644) 


0.9168 


(0.8776) 


?A;*i,i 




0.0864 


(0.0532) 


0.4088 


(0.3104) 


0.7612 


(0. 


,6720) 


0.9264 


(0.8832) 


<f> (n) 
r A ;S 




0.0920 


(0.0556) 


0.3896 


(0.3028) 


0.7336 


(0 


,6488) 


0.9092 


(0.8564) 


<f> (n) 
?A;w 




0.0824 


(0.0524) 


0.3872 


(0.3072) 


0.7376 


(0. 


,6552) 


0.9108 


(0.8728) 


?A;SP 




0.0752 


(0.0588) 


0.3536 


(0.2992) 


0.6648 


(0. 


,6064) 


0.8604 


(0.8220) 



consistent with the overrejection phenomenon. The corresponding rejection 
frequencies are reported in parentheses in Table 3. They all are quite close 
to the nominal probability level a = 5%, while empirical powers are in line 
with theoretical ARE values. 

APPENDIX 

We start with the proof of Proposition 3.1. To this end, note that although 
generally stated as a property of a parametric sequence of families of the 
form = {PL"V € O} (n € N), LAN (ULAN) actually is a property 
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4.3 



of the parametrization ui \— > , u> G Q of (i.e., of a bijective map 
from CI to V^ 1 ). When parametrized with u := (6', (vech £)')'j <^ G := 
R fc x vech(5fc), where 5^ stands for the class of positive definite symmetric 
real k x k matrices, the elliptical families we are dealing with here have 
been shown to be ULAN in Hallin and Paindaveine (2006a), with central 
sequence 



=i 



(A.l) A 



(n) ._ 



with cij = dj(#,S) and Uj = Ui(0,E), where P' k denotes the duplication 
matrix [such that P^ vech(A) = vec(A) for any k x k symmetric matrix A]. 

The families we are considering in this proposition are slightly different, 
because the ^-parametrization requires k identifiable eigenvectors. However, 
denoting by Q B := ~R k x vech (S B ), where S B is the set of all matrices in Sk 
compatible with Assumption (A), the mapping 

c!:lj = (0',(vech£)')' €ft B 

d(u) := (0', (det E) 1 /*, (dvec A s )'/(det E) 1 ^, (vec 0)')' G 
from the open subset Q B of ]^ fc + fc ( fc + 1 )/ 2 to is a differentiable mapping 
such that, with a small abuse of notation, P, „ * = P^ \ t and P&> = 

P^a-i^y^ (with (f -1 defined on only). The proof of Proposition 3.1 
consists in showing how ULAN in the u;-parametrization implies ULAN 
in the i9-parametrization, and how the central sequences and information 
matrices are related to each other. Let us start with a lemma. 

(n) 

Lemma A.l. Let the parametrization u i— > P^ , u> G fi, where fl is 
an open subset of R kl be ULAN for V^ n) = {P^| uj G ft}, with central se- 
quence and information matrix T^. Let d : u> \— > # := d(u>) be a con- 
tinuously differentiable mapping from to R fc2 (k2 > k\) with full col- 
umn rank Jacobian matrix Dd(u) at every u> . Write := d(Cl), and as- 
sume that > P^ 1 , i9 G provides another parametrization ofV^. Then, 
$ i — y Pq, i? G is also ULAN, with [at # = d{u>)] central sequence A^ = 

{D~ 5(u))'Au fl^rf information matrix T$ = (D~ d(u))'T jD~ d(u) , where 
D~ d{u) := ((Dd(u>))' Dd(u>))~ 1 (Dd(u>))' is the Moore-Penrose inverse of 
Dd(u). 

Proof. Throughout, let i9 and u be such that i9 = d(u). Consider i9 G 
and an arbitrary sequence i?^ n ^ = i9 + C^n" 1 / 2 ) G 0. The characterization 
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of ULAN for the i?-parametrization involves bounded sequence t^} £ R fc2 
such that the perturbation i9^ n ^ + n~ l / 2 T^ still belongs to 0. In order 
for $( n ) + n~ l / 2 T^ to belong to 0, it is necessary that be of the 
form rl™^ + o(l), with rl™^ in the tangent space to at &( n \ hence of 
the form + o(l) with in the tangent space to at i9, that is, 
tW = Dd{uj)w^ for some bounded sequence w^ n ' G R 1 . It follows from 
differentiability that, letting = 5 _1 (tf (n) ), 

M + „-V2 T W = 0^ + ^/^(^(n) + oin- 1 ' 2 ) 

= 3( w ( n )) + rr^lMwW") + o^" 1 / 2 ) 

(A.2) 

= (Z(w (n) ) + n- 1 / 2 D3(cj( n V (n) + o(n~ 1/2 ) 

Hence, turning to local logdikelihood ratios, in view of ULAN for the u- 
parametrization, 

log ( dp i n i) +n - 1/2T (:)/ dP it)) 

= wW'A^, - iwW'r w wW + 0P (1) 
under P^o = P^i) -probability, as n — > oo. Now, the LAQ part of ULAN for 

(n) 

the #-parametrization requires, for some random vector A^(„) and constant 
matrix 

(A.4) log(<i )+n _ 1/2Tl V<t)) = tWa% - \t^T»t^ + op(1) 

under the same P^ n) = P^(i) probability distributions with, in view of (A.2), 
T ($ = Dd(u)w^ + o(l). Identifying (A. 3) and (A.4), we obtain that LAQ 

(n) 

is satisfied for the i?-parametrization, with any A^ satisfying 

(A.5) (Dd(u))' A^ = AL n) . 

Now, let tj be the ith. column of Dd(u), i = 1, . . . , ki, and choose t^+i, . . . , 
tfc 2 £ R fc2 in such a way that they span the orthogonal complement of 
M{Dd(u)). Then {tj, i = 1, ...,fc2} is a basis of R fc2 , so that there exists 
a unique A: 2 -tuple • • ■ ,^ k J such that A^ } = E 4 =i With this 

notation, (A.5) yields 

A&> = (Dd(u)YA^ 
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i=l i=l 



where we let 
this entails (i) A, 




hence Ajj 1 = (D~d(u>))'Au . As a linear transformation of A^ , Ajj 1 
clearly also satisfies the asymptotic normality part of ULAN, with the de- 
sired rv □ 

The following slight extension of Lemma A.l plays a role in the proof 
of Proposition 3.1 below. Consider a parametrization u = (u' a ,u' b )' i-> P« , 
u> € fl x V, where fi is an open subset of and V C K m is a ^-dimensional 
manifold in R m , and assume that it is ULAN for -p (n) = {P^V G n x 
V}, with central sequence A^ and information matrix r u . Let <3 a be a 
continuously differentiable mapping from M. kl to M fe2 (k 2 > &i) with full 
column rank Jacobian matrix Dd a (u a ) at every cj a , and assume that $ : = 

3(w) ^Pf'.iJeexV [with := 5 a (^)], where 

5:fixV — >0xV a; = (aj a ,aJ6y h-> &{u) = (4(w a ),Wi,)' 
provides another parametrization of T 7 ^- Then the proof of Lemma A.l 

(n) 

straightforwardly extends to show that i? i— >■ P^ , i? 6 x V is also ULAN, 

still with [at 1? = 3(u;)] central sequence A^ l) = (D-afw))'^ and infor- 
mation matrix r# = (D~ ' d(u)))'T U} D~ d(uj). 

Proof of Proposition 3.1. Consider the differentiable mappings d\ : 



w := (0', (vech £)')' ^ = (0', (dvec A E )', (vec/3)')' and 3 2 : = (0', 

(dvec A s )', (vec/3)')' d 2 (d l (u)) = (0',cr 2 , (dvec A v )', (vec/3)')' G 0, the lat- 



ter being invertible. Applying Lemma A.l twice (the second time in its "ex- 
tended form," since the /3-part of the parameter is invariant under d 2 ) then 



In view of the definition of M fc v (Section 3.2), the Jacobian matrix, com- 
puted at 1?, of the inverse mapping 1 is 



yields 




{Dd 2 (d l {u)))'- l Dd 1 {u){{Dd l {u))'Dd 1 {u))- 1 A t 

(Ddz\d(u))) , Dd 1 (u;)((Dd 1 (u)yDd 1 (u)))- 1 A£ 




dvec(Av) ct 2 (m£ v )' 
OK: 
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An explicit expression for Dd\{u) was obtained by Kollo and Neudecker 
[(1993), page 288]: 



Ddx(oj) 

(A.6) 



l\®2 



with UfaKn ■-- 



/3i®[)9(Ai iE I fc -A S j)-)3 / ] 
\)9 , fc ®D3(A fciS I fc -A E )-^/ 



The result then follows from a direct, though painful, computation, using 
the fact that 

(PfcE^E^P'fe)- 1 = (P' fe )-(/3®/3) diag(Z 11;S ,Z 12;S , . . . ,l kk ;j;)(P' ®(3')P- , 

with = 1 if i = j and = (Aj ; s — Aj ; s) -2 if i 7^ j; (Pfc) - here stands 
for the Moore-Penrose inverse of [note that (Pi) - is such that 
Pl(Pl) — vec(A) = vec(A) for any symmetric matrix A]. □ 

Proof of Proposition 3.2. Proceeding as in the proof of Lemma A. 1 , 
let Vj be the ith column of Dh(£ ), i = 1, . . . , m, and choose v m +i, . . . , v p £ 
MP spanning the orthogonal complement of M(Dh(^ )). Then there exists a 
unique p-tuple (<5# o; i, . . .,^ o;p )' such that A# = Ya=i ^o;» v « (since Vj,« = 
l,...,p spans MP) and 

p m 

A^ = DH'(S )A* =Y,So ;iDti(£ )v l =J2 S #o;iDti(£ )vi 
i=i i=i 

(A.7) 

= C r! (£ )A£ , 

where C fi (£ ) := Dh'(£ Q )Dh(£ ) and A^ Q := (<fo o; i, . . . , <fo o;m )'. Hence, we 
also have = Cfi(^ )r^ o Cs(^ ), where T^ Q is the asymptotic covariance 

matrix of A^ Q under P^- Using the fact that C^(^ ) is invertible, this 
yields 

Q €o := (AX)'C s (^ )(C,(^ )r^C r! (^ ))- 1 C r! (^ )AS i 

-(AS t )'C R (eo) J Dn«o)pf(«o)C R (e )rS t C fi (| ) J Df(ao))- 1 
x Df (a )C R (£ )A% 

= (A^ )'(rx)- 1 AS l 
-(AX) , (rx)- 1/2 n((r^jV2 C ^ ( ^ o)Dr(Qo))(rslo) -i/2 A: n o 

where II(P) := P(P'P)~ 1 P / denotes the projection matrix on Ai(P). 
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Let t>:A C R e ->■ W be a local (at # ) chart for the manifold C n ©, 
and assume, without loss of generality, that r] = 5~ 1 (i?o)- Since Dh(£ ) has 
maximal rank, it follows from (A. 7) that A$ = Dh(^ )A^ l . Hence, the 
statistic 

(A.8) Q# := K a (T^ o - ^6(r 7o )( J D5 / (7 ?0 )r^D5(r 7o ))- J D6 / ( % ))A 1?0 
[the squared Euclidean norm of the orthogonal projection, onto the lin- 

1 12 

ear space orthogonal to 1^ Dh(r) ), of the standardized central sequence 

1 12 

(r^ )~ A^ ] can be written as 

Q» = (A%jDhXt )(Dh(Z )T% Dh'(Z )rDh(Z )A% 

-(A^) / i5fi'^o)I>6(»7o)Pe'(»7o)^«o)r%I>^«o)£>6(»?o))" 
x Dd'(r, )Dh(t )A% 

= (A%jDh\Z )(Dh(t )TZDhXt ))-Dh(Z )A% 

-(A%y(r%)-^n((r%)^^(€ )j36(t| ))(r%r^A% 
= : Qi?o,i ~~ Q&o,2- 

Since Dh(£ ) has full rank, the standard properties of Moore-Penrose 
inverses entail Q^ ,i = Q# ,i- As for Q^ ,2 and (3i9 ,2j they are equal if 

^((rx) 1/2 c«(| )£»t(ao))=^((rX) 1/2 ^ / (4o)^5M)- 

Since r# and Cft(£ ) are invertible, the latter equality holds if M(Dt(eto)) = 
M((C n (€ ))- 1 Dh'(€ )Dd(<q )), or, since Dh(£ ) has full rank, if 

M(DH(Z )Dt( ao )) =M(Dh(^)(C h (^)r 1 Dh\^)Dd(r 1o )) 

(=M(IL(Dh(Z ))Dd(r, ))), 

which trivially holds true. Hence, Q^ ,2 = Q& ,2i so that Q^ Q = Q& - 

1/2 1 /2 

Eventually, the linear spaces orthogonal to 1^ Dt>{r)o) and to Dh(r] ) 
do coincide, so that the statistic Q-o , which is obtained by substituting 5 
for 5 in (A.8), is equal to Q& (=Q£ )- This establishes the result. □ 

We now turn to the proofs of Lemmas 4.1 and 4.2. 

Proof of Lemma 4.1. The proof consists in checking that postmul- 
tiplying D fc (Av) with N fc H fc P Av (I fc2 + K fc )Af 2 (P Av )'H' fc N' fc yields the 
(k — l)-dimensional identity matrix (P Av and are defined in the state- 
ment of the lemma). That is, we show that 

lM Av H fc (I fe2 + K fc )(A v 1 )^H' fc (M Av )'N fe H fc 
(A.9) x P A v(I fc2 +K k )Af (P Av )'H' fe N' fc 
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First of all, note that the definition of (see Section 3.2) entails that, 
for any k x k real matrix 1 such that tr(A^l) =0, (M Av )'N fe H fe (vecl) = 
(M Av )'(dvecl) = dvec(l) = Hfc(vecl). Hence, since (letting Ejj := e^e^ + 

P Av (I fe2 + K fc ) = I fe2 + K k - ^Af vec(A- 1 )(vec(A v 1 ))' 



k 

^ vecl 

»J=1 



i(A v 1 )yA v VvecEy) / 



= :^(vecF Av )(vecE ii )', 

with tr(AY 1 Fy V ) = 0, for all i, j = 1, . . . , k, we obtain that (M Av )'N fc H fc x 
P Av (I fc2 + K fe ) = H fc P Av (I fc2 + K fc ). Now, using the fact that 
H^Av 1 )®^ + K k )W k = (Ay 1 )® 2 ^ + K k )H' k , the left-hand side 
of (A. 9) reduces to 

(A.10)lM^H fc (I fc2 + K fe )(A v 1 )® 2 Pfv(I fc2 + K fe )Af 2 (P A v)'H' fc N' fc . 

After straightforward computation, using essentially the well-known prop- 
erty of the Kronecker product vec(ABC) = (C <g> A) vec(B) and the fact 
that M Av x ^(vecA^ 1 ) = and U k K k = H fc , (A. 10) reduces to 

M Av H fe H' fe Nfc. The result follows, since H fc H fc = I k and M Av Nj, = I fc _x. 

□ 

Proof of Lemma 4.2. All stochastic convergences in this proof are as 
n — > oo under ~P^) gi , for some fixed i3e9 and g\ G J-f . It follows from 

(A.ll) M Av H fc 09')® 2 (V- 1 )® 2 vec V = M Av H fc (vec A v *) = 
and 

(A. 12) Lf Av (V- 1 )® 2 vec V = Lf' Av (vec V- 1 ) = 0, 

that 

A ^ = ^MjvH^A- 1 ^')® 2 



E^^vec(U,,(0,V)U:(0,V)) 



x 

a 

i=l 



a k 



r M Av H fc 03')® 2 CV- 1 ] 



2^a 2 k 

n 

J>ec((X 4 - 0){Xi - 9)' - {D k { 9l )/k)T.) 



x 
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and 

IV _ a k n /3 r /3,Ay/ A r-l/2\®2'S 



A 



_^ G ^ L AA V (y _ 1/2)8 2 £ vec(Uj(0) v)u / (0! v)) 



2Vncr 2 



g p l p,v v(v - 



n 

X 

i=l 



^vec((x, - e)(x< - ey - (D k ( gi )/k)v). 



Hence, using a root-n consistent estimator $ := (6', a 2 , (dvec A-y)', (vec/3)')' 
and letting 5] := a 2 j3A\-p , Slutsky's lemma yields 



^vec((X i - 0)(X; - 9)' - (D k ( gi )/k)±) 
2MfH fc (^') 02 (V-f 2 



x 

i=i 



x J ^vec((X 4 - 0)(X< - 0)' - (DfcOa)/fc)E) 

U=i 

- nvec((X - 0)(0 - 0)') - nvec((0 - 6>)(X - 6)') 
+ n vec((0 - 6){6 - 6)') - n{D k { gi )/k) vec(S - £) j 



2ka 2 

+ o P (l), 
and, similarly, 



2^Ea 2 k k 1 ' 

n 

x ^vec((X l - 0)(X; - BY - (D k ( gi )/k)±) 

i=l 

+ op(l). 
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Writing S — S = (a 2 — <7 2 )V + <r 2 (V — V), applying Slutsky's lemma again, 
and using (A. 11), (A. 12) and the fact that Kfcvec(A) = vec(A'), we obtain 



A^ = A^ - ^^M^H k (PT\V~Y 2 n 1/2 vec(V - V) 
(A.13) 



2k 

+ op(l) 



= - ^^MfvH.^O^CV" 1 )^^ + K fe ] 
x n 1/2 vec(V- V) +o P (l) 

and 

A^=A^ - ^ll G ?Lf A v ( v-i)«V/ 2 vec(V-V) + 0p (l) 

(A.14) = - ^^Gf Lf • ^(V 1 )® 2 ^ + K fc ]nV 2 vec(V - V) 

+ op(l). 

Now, Kollo and Neudecker (1993) showed that 

where was defined in (A. 6). Similar computations as in the proof of 

Proposition 3.1 then yield 

n^vecOV"- V) 

1/2/,=,/ ~ /dvec(A v - A v )\ . m 

= B /( fiMvflftAv ) J+0P(1) 

(A.15) 

= (Lf Av )'(Gf)'n 1 / 2 vec( / 3-/3) 

+ /3 02 H> 1 / 2 dvec(A v - A v ) + op(l). 
The result for A^ then follows by plugging (A.15) into (A.13) and using 

the facts that H fc (/3')® 2 (V" 1 )® 2 (Lf Av )' = and ?i 1 / 2 dvec(A v - A v ) = 
n 1 / 2 (M Av )'dvec(Av — Av) + op(l) as n — > oo (the latter is a direct conse- 
quence of the definition of M Av and the delta method). As for the result 
for A^ , it follows similarly by plugging (A.15) into (A.14) by noting that 

GfLf ^(V" 1 )® 2 ^ +K k ]f3® 2 H> k = 0. □ 

PROOF of Lemma 6.1. Throughout fix i9 = (9', a 2 , (dvec A)', (vec/3)')' G 
"H A q and gi S J a , and define V := /3 Tylcr Av/9' Ty i cr . Since vec(A) = vec(A') 
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and c' Ptq H k j3'^ ei (V 1 / 2 )® 2 (yecI k ) = 0, we obtain, from (6.4), 



(A.16) 



Jk(K) 
Jk{K) 



/2\<g>2 



4k(k + 2)J k (K) 

x (V- 1 / 2 )®V/ 2 vec(V-V) + o P (l) 

as n — > oo, under Pj^ • 

We now show that the second term in (A.16) is op(l) as n — > oo, under 
Since n 1 / 2 vec(V — V) is Op(l), Slutsky's lemma yields 

(a M (Av))- 1 / 2 < ig H fc ^ r (vV2)^ (v -i/2 r 2 n i/2 vec( v_ v) 

= MAy^H^n 1 / 2 vec(V - V) + o P (l). 

By construction of the estimator Ay, c' p g Hfc/3!^ 2 er (vec V) = 0, so that we 

have to show that n 1 / 2 ^ ^H^ vec( / 9^ ryler Vy9 Tyler ) is op(l). We only do so 
for 1? values such that Ai ; v = • • • = X q -y =: AJ > A2 := A g +i ; v = • • • = Afc ; v, 
which is the most difficult case (extension to the general case is straightfor- 
ward, although notationally more tricky). Note that the fact that 1? £ H A q 
then implies that 

(A.17) - pq x* + (l- p )(k-q)\* 2 =0. 

Partition E := /3'/3 Tylcr into 

(A.18) E=f^ J? 12 

where En is q x q and E22 is (k-q)x(k-q). As shown in Anderson [(1963), 
page 129] n 1 /2( Ell E' n - I q ) = o P (l) = n l / 2 {V 22 E' 22 - I fc _ g ) and nV 2 E 12 = 
Op(l) = n 1 / 2 E' 21 as n — > 00, under P^!^ [actually, Anderson (1963) proves 
this only for E = /3'/3g and under Gaussian densities, but his proof readily 
extends to the present situation]. Hence, still as n — > 00, under P^^j 

reV2c p,g H fc VeC (^Tylcr V /3Tylcr) 

= -pin 1 / 2 XI trCE'nEn) + n^ 2 X* 2 tr(E 21 E 2 i)} 
(A.19) + (1 - p){n^ 2 X\ tr(E' 12 E 12 ) + n l ' 2 X* 2 tr(E 22 E2 2 )} 
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= - P {n^ 2 Xltr(I q )} + (1 - pHn 1 ' 2 ^ tr(I fc _ ff )} +op(l) 

= op(i); 

see (A. 17). We conclude that the second term in (A. 16) is op(l), so that 

(n) _ / nk(k + 2) \ 1/2 
~^ " V Jk{K) ) [a ™ [Av)) C ™ 

><H,^ y L(V 1/2 ) 02 J^vec(S^) 
+ op(l) 

= v^rJ (0m(Av)) c ™ 

x HfcE" g,2 (/3 / )® 2 (V 1 / 2 )® 2 J^- vec(S^"^-) 
+ op(l). 

Since n 1 / 2 J^ - vec(S^!^) is Op(l) under P^^, Slutsky's lemma entails 

(n) _ / nk(k + 2) \ 1/2 1/2 
~^ " V Jfc(A') J {a ™ [Av)) C p* 

xH fc (diag(E' n ,E 22 ))^ 2 (/3')® 2 
x(vV2)^ vec (s^) + p(l) 

(A.20) 

fnk(k + 2)\ 1/2 1/2 , 

xH fc (diag(E' 11 ,E 22 ))^ 2 (/3')^ 2 
x(vV2)^ V ec(S^) + 0p (l), 

where we used the facts that S^ K is Op(l) and that 

nV^c^H^diaglEi^E^))^ 2 ^) 02 ^ 1 / 2 )^ 2 ^^) 
= n l ' 2 {-p\\ trCEnE'n) + (1 - p)\* 2 tr(E 22 E' 22 )} 
= n x ' 2 {-p\\ tr(I 9 ) + (1 - p)X* 2 tr(I fc _ ff )} + o P (l) 
= o P (l). 
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Then, putting [with the same partitioning as in (A. 18)] 

P V **;K V P - - I f („) , f („) , I ' 

the asymptotic properties of S^ K and Ejj, j = 1,2 imply that 

iP = (^^) 1/2 (a M (Av))- 1 / 2 {-ptr(E' 11 (D^) 11 E 11 ) 

+ (1 -p)tr(E' 22 (D^) 22 E 22 )} + 0P (1) 
! ^f) 1/2 (a M (Av))- 1/2 {-ptr((D^) 11 ) 

+ (l-p)tr((D^) 22 )} + 0p (l) 

= zSi + op(i) 

as n — > oo, under P^.^, which establishes the result. □ 

Proof of Proposition 6.2. Fix # G "Hq-'i an ^ 9i G -^V We have al- 
ready shown in Section 6.2 that Q$ — K = op(l) as n — > oo under 

P^. Sl - Proposition 6.1 (i) then yields 

n (n)_ A IV> u T IV \- _t>P° (CpP°\i V IV p0V(P /3 VlA 7V 

(A.21) 

+ op(1), 

still as n — )• oo under P^. ffl • Now, since 

= ^ G fc°diag(Ifc_i,0( fc _ 2 )( A .„i) /2x ( fc „ 2 )( fc „i)/ 2 )(Gf )' 

is idempotent with rank (k — 1) [compare with (4.4)], it follows that 

is asymptotically chi-square with (k — 1) degrees of freedom under P^. gi , 
which establishes the null- hypothesis part of (i). For local alternatives, we 
restrict to those parameter values #o G T~i^ for which we have ULAN. From 
contiguity, (A.21), also holds under alternatives of the form ,, 
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Le Cam's third lemma then implies that under P " , 1/9 M > is 

asymptotically noncentral chi-square, still with (fc — 1) degrees of freedom, 
but with noncentrality parameter 

lim {(r Iv ^y 

n— >oo 

Evaluation of this limit completes part (i) of the proof. 

As for parts (ii) and (iii), the fact that 4>^) K has asymptotic level a di- 
rectly follows from the asymptotic null distribution just established and the 
classical Helly-Bray theorem, while asymptotic optimality under K$ x scores 

is a consequence of the asymptotic equivalence, under density f\ , of Qx f 
and the optimal parametric test statistic for density f\. □ 

Proof of Proposition 6.3. Fix # e and gi G F a . It directly 
follows from Lemma 6.1 and Proposition 6.1 that 

TP = (grad , / l (dvecA^)(r^.^)- 1 grad/ l (dvecA^))- 1 / 2 

x grad / / l (dvecA^)(r^)" 1 A^ gi + 0P (1) 

as n — > oo, under P^. ffl , hence also — provided that i?q € T~Lq — under the con- 
tiguous sequences P^ +n _i/ 2T („). ffi - Parts (i) and (ii) result from the fact that 
^Sq K gi * s as y m PtoticaHy normal with mean zero under P^. gi and mean 

lim {J k (K, 9l )/(k(k + 2))D fe (Av)r /// W} 

n— >oo 

under P^ +n -i/2 T (n). fll (Le Cam's third lemma; again, for -&q G T~Lq), and 

with covariance matrix J7fc (if + 2))Dfc(Av) under both. Parts (iii) 

and (iv) follow as in the previous proof. □ 
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